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Abstract 



In this thesis many-body Quantum Monte Carlo (QMC) calculations are presented for the 
ground state and excitation energies of Ge atoms, molecules, and clusters as large as Ge2gH36 

1.2 nm diameter). The hydrogen terminated Ge n H m nanoclusters are of particular in- 
terest since they have optical properties that are different from small molecules and bulk 
Ge, and are viewed as an ideal model for quantum confined semiconductor systems. The 
present QMC results are compared with previous QMC calculations for the corresponding Si 
molecules and clusters. In addition, the QMC gaps for Ge n H m are found to be higher than 
the gaps reported in recent time-dependent density functional studies, by amounts similar 
to that previously found for Si systems. 

For materials containing Ge it is necessary to deal with the issue of correlation between 
the core and valence electrons. The accuracy of QMC for heavy atoms such as Ge (Z=32) is 
limited by the fact that the core-valence interactions cannot be treated at the same many- 
body level as the valence-valence interactions. Typically the core-valence interactions are 
treated at a single-body level via a pseudopotential, but such methods are unsatisfactory 
for Ge which has a shallow, easily polarizable 3d core. Previous work has proposed using 
a Hartree-Fock pseudopotential (relativistic) plus a core-polarization potential (CPP) to 
account for core-valance correlation at a many-body level. 

A major part of this work involves quantifying the effect of core-valence correlation via 
the CPP. The CPP is found to be important for accurate calculations of the total energy, and 
for excitations of atoms and small molecules. However, there are only small changes in the 
lowest optical excitations of larger clusters, which can be understood in terms of the nature 
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of the excitations, mainly the relative Ge s to p atomic orbital character of the excited state. 
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Chapter 1 
Introduction 



The problem of understanding interacting electrons is one of the most important challenges 
in science. Electronic correlations control the properties of everyday matter and determine 
critical properties such as electronic excitations, stability and phases and optical properties 
[29]. 

The subject of this thesis research is an accurate treatment of interacting electrons in 
semiconductor materials such as clusters of silicon and germanium in the nanometer size 
range using a combination of quantum Monte Carlo (QMC) many-body methods coupled 
with analytic analysis of the critical issues. Quantum Monte Carlo is ideal because it can be 
applied to a variety of interacting electron systems and explicitly treats correlation [30, 31, 
7, 8]. It is the only known method that can be applied to accurately treat systems of many 
electrons. 

The main focus of the thesis will be on germanium, where the influence of relativity and 
core-relaxation effects play an integral role in the electronic structure [32, 9]. In addition, Ge 
is a test case for an important problem, conceptually and practically: the issue of correlation 
between core and valence electrons. This is not explicitly included in single-body methods 
and is difficult to include in many-body methods because of the large difference in the 
energies and spatial extent of core and valence electrons. We have chosen Ge since the effect 
is largest for shallow, polarizable cores in atoms such as Ge. 

As was already alluded to, the proper treatment of the core-electrons for Ge represents 
a difficulty. Heavy atoms with large cores such as Ge [Z = 32) pose a problem for QMC 
because the computational effort dramatically increases with the atomic number Z [33, 7]. 
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Calculations are greatly simplified by removing the core-electrons and replacing them with 
an effective-core pseudopotential or pseudo-Hamilitonians [34, 35]; 1 this thesis deals exclu- 
sively with pseudopotentials. Typically pseudopotentials are constructed within single-body 
methods such as density functional theory (DFT) in the widely used local density approxi- 
mation (DFT-LDA) or Hartree-Fock (HF), which means that the core- valence interactions 
of any subsequent QMC calculation on the valence electrons are treated at the same level. 
While this approach is sufficient for most elements in the periodic table, where core-valence 
interactions are negligible, it is unsatisfactory for post-transition elements such as Ge which 
has a shallow and easily polarizable 3d core. It is well known that DFT-LDA underestimates 
the gaps for excitations, whereas HF overestimates the gaps [9, 37, 38, 29]. In fact, Ge is a 
well known case where the band gap is zero in the DFT-LDA. 2 

Core-Polarization potentials [40, 41, 42] (CPP)s represent an improved method to treat 
the core-valence interactions over mean-field methods such as DFT-LDA and HF. These po- 
tentials include dynamical effects of the core dipoles interacting with valence electrons and 
can be included in all-electron and valence-only calculations. Shirley and Martin (SM) have 
developed an ah initio CPP formalism that corrects the HF treatment of core-valence inter- 
actions [43]. Their parameterized CPPs are fit to quasiparticle self-energies, which include 
many-body effects such as core-relaxation and polarization and core-valence correlation. 

QMC calculations for various elements have been performed including CPPs [44, 45, 46, 
47, 48, 26], as discussed in Section 7.7. Previous QMC calculations of Ge [49, 50, 51] have not 
used a CPP and treated the core-valence interactions with local [52] LDA pseudopotentials, 
which are known to be not as accurate as non-local pseudopotentials, to treat the core- 
valence interactions. These studies reported results of the total energy the ground-state of 

1 An exception are the all-electron QMC calculations for atoms as large as Xe (Z = 54) by Towler and 
co- workers [36]. That work is a tour de force calculation; it is not fully converged and is proposed as an 
approach that can be feasible since errors in the core may cancel for interesting quantities. This gives support 
to the idea that it is much better to use pseudopotentials provided it can be shown that they are sufficiently 
accurate. 

2 For post-transition elements such as Ge, DFT-LDA overbinds the 4s and 4p electrons, which is believed 
to be responsible for underestimating the band gap minimum [9, 39]. 
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the Ge pseudoatom and bands of the solid. 

There has been much interest recently in clusters of semiconductors, such as Si and 
Ge, consisting of tens to hundreds of atoms. Such nanoclusters have properties that can 
be completely different from small molecules and from bulk materials. In particular, their 
optical properties have been the focus of intense experimental and theoretical interest due 
to the observation of visible photoluminescence and their compatibility with existing Si 
based technology. For Ge nanoclusters, experiments have observed evidence of quantum 
confinement effects [22, 53, 54, 55] and size dependant photoluminescence in the near- infrared 
region, which could possibly be due to a radiative recombination of excitons confined in the 
nanoclusters [22]. 

The interest in these clusters has led to many theoretical investigations using a variety 
of methods to predict the effects of quantum confinement on the energy gaps and optical 
properties. Most of the previous theoretical work has been on Si clusters, including studies 
using time-dependent density functional theory (TDDFT) [24, 26, 2, 56, 1] and many-body 
"GW-BSE" [25, 57, 56] and quantum Monte Carlo [26, 2] methods. Theoretical studies 
for Ge have been much more limited; to our knowledge the only published calculations are 
ones using tight-binding [58], empirical pseudopotentials [59, 60], density functional theory 
(DFT) in the local density approximation (LDA) [61, 62], a combination of DFT in the 
LDA or local spin density approximation (LSDA) with delta self-consistent field theory 
("ASCF") [63, 64, 65] and TDDFT [1, 28]. The qualitative trend of all of these reports is a 
significant increase in the optical gap as the cluster diameter decreases; however, there are 
disagreements between the theoretical results. 

A major component of this thesis is dedicated to many-body calculations of Ge clusters 
using the quantum Monte Carlo (QMC) method. To our knowledge there have been no 
previous QMC calculations for Ge clusters. 3 Results are also reported for atoms and 

3 During the course of this work it came to our attention that a group at Lawrence Livermore National 
Laboratory (Andrew Williamson, Aaron Puzder and Giulia Galli) were also doing QMC calculations of 
the optical gaps of Ge clusters, which closely parallels their work for Si clusters in Ref. [2]. Their results 
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molecules that test the accuracy of the methods. The effect of core-valence correlation 
via the CPP is quantified within QMC for total energies and excitation energies. In this 
thesis only ideal, hydrogen passivated structures are considered. These are the most relevant 
clusters for the first theoretical study of Ge clusters using QMC, since these are well defined 
structures and comparisons can be made with previous results for Si clusters having the 
same structures. However, it should be emphasized that for comparison to experiment, the 
actual structures are not known, and quantum confinement in ideal structures is not the only 
mechanism that can affect the optical properties of the nanoclusters. Factors such as the 
structure and surface chemistry, including surface terminations and reconstructions, have 
been shown to be important for Si clusters [66, 67, 68, 69, 70]; however, such effects are not 
considered in the present work. 

1.1 Thesis Organization 

• This chapter serves as an introduction and includes a brief description of certain fun- 
damental concepts that are used throughout this thesis. 

• Chapter 2 discusses various single-body methods such as Hartree-Fock (HF), Density 
Functional Theory (DFT) and Time-Dependant DFT (TDDFT). Chapter 3 discusses 
various many-body methods such as Configuration Interaction (CI) and the GW ap- 
proximation, including GW combined with a solution to the Bethe-Saltpeter equation 
(GW-BSE). 

• Chapters 4, 6 and 5 discuss the quantum Monte Carlo (QMC) method and the de- 
velopment of qmcPACK, an new QMC code. Detailed reviews are provided for the 
Variational Monte Carlo (VMC) and Diffusion Monte Carlo (DMC) methods and the 

for Ge clusters are not published and have not been shared with us or the general public at the time of 
the submission of this thesis. Wc have discussed our work with them and they have suggested that our 
independently calculated results arc in reasonable agreement with their results. 
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Fixed-Node (FN) approximation for DMC. Chapter 6 describes the many-body trial 
wave function used in QMC, and the specific form of the trial wave function that is 
used for various Ge systems. 

• Chapter 7 discusses the concept of core- valence partitioning and the need for pseudopo- 
tentials and core-polarization potentials. The specific pseudopotential that is used in 
place of the Ge atomic core is introduced. In addition, a method of evaluating non- 
local pseudopotentials (NLPP)s in QMC is reviewed, and discussion is provided for 
the consequences of using NLPPs within DMC. 

• Chapters 8 and 9 quantify the effect of the core-valence correlation on the excitations 
of atomic Ge (Chapter 8) and the small molecules GeH 4 and Ge 2 H 6 (Chapter 9) within 
QMC. This work is crucial before we move on to our ultimate goal of determining the 
optical gaps of the hydrogenated Ge nanoclusters Ge n H m , for (n + m) < 65 (Chap- 
ter 10). 

• In Chapter 10 results for the optical gaps are compared to recent TDLDA results and 
DMC results for similar Si n H m structures are presented. A detailed review of previous 
experimental and theoretical work on these systems is also provided. 

1.2 The Born-Oppenheimer Hamiltonian 

The Born-Oppenheimer approximation is based on the assumption that the electronic and 
ionic degrees of freedom of a system can be decoupled. The A^— electron Hamiltonian for 
the electronic degrees of freedom is given by 




(1.1) 
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where rj is the coordinate of electron i and V ex t is an external potential. The last term is the 
Coulomb interaction between the electrons. The Born-Oppenheimer Hamiltonian is used 
throughout this thesis. 



1.3 Exchange and Correlation 

The wave function of any electronic system must be antisymmetric under the exchange of two 
electrons due to the exclusion principle for fermions. A consequence of the antisymmetric 
nature of the wave function is that there is a spatial separation between same spin electrons 
which reduces the Coulomb energy. This phenomenon is known as the exchange interaction; 
exchange is treated exactly in the Hartree-Fock approximation, see Section 2.3. It is possible 
to further lower the Coulomb energy by spatially separating opposite spin electrons, but this 
must be balanced against any increase in the kinetic energy. This phenomenon is known as 
correlation, by definition the correlation energy is the difference of the exact energy and the 
Hartree-Fock energy 



The correlation energy is always negative since the Hartree-Fock energy is variational [71]. 

1.4 A Brief Description of Electronic Excitations 

Electronic excitations can be classified into two types: (i) single particle excitations in which 
one electron is removed N — > N— 1 or added N — > N+l to a system, (ii) particle conserving 
or neutral excitations N — > N. The former correspond to electron addition and removals and 
can be probed experimentally by photoemission or inverse photoemission. Optical properties 
are determined by the latter class of excitations. Optical excitations are viewed as electron- 
hole excitations, where an added electron interacts with the hole left by removing electron. 



Ecorr — E Exact — E HF . 
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1.4.1 Quantum Confinement in Semiconductor Nanoclusters 

For semiconductor nanoclusters quantum confinement, caused by the restricted (nanomet- 
ric) size of the nanoclusters, can result in superior properties for device applications when 
compared to those of bulk materials [72] . For Si and Ge nanoclusters, the quantum confine- 
ment model predicts a shift in the energies of the valence band and the conduction band 
edges relative to the bands of bulk; leading to an increased energy gap relative to their bulk 
counterparts. 

One of the most interesting applications of quantum confinement is the ability to manip- 
ulate and tune desirable optical properties such as the fundamental absorption and photolu- 
minescence by changing the system size and shape. Size dependant photoluminescence has 
been observed experimentally in Ge [22] and Si [73, 74] nanoclusters. In addition, a large 
amount of theoretical work is dedicated to predicting the size dependance of optical proper- 
ties (gaps) of Si and Ge nanoclusters. Usually these nanoclusters are modeled as bulk-derived 
structures with the dangling bonds saturated with a passivant such as hydrogen. 

Visible photoluminescence has been observed from both Si and Ge nanoclusters produced 
by a variety of techniques 4 [75, 76, 77, 78, 74, 79, 80, 81, 82, 83, 84, 85, 86, 87, 88, 89], 
a phenomenon not observed at the macroscopic length scale (bulk limit) of either of these 
materials. (Although there is some controversy over the mechanism responsible for the visible 
photoluminescence; the role of quantum confinement versus the role of defects). It is not 
difficult to imagine that visible photoluminescence has a tremendous number of potential 
applications. 

1.4.2 Absorption and Emission in Nanoclusters 

Optical absorption is the process by which a system is excited from its electronic ground 

state to an excited state by absorbing light. The absorption of a photon leads to the creation 
4 The most common technique it to grow the nanocrystals in a glass (Si02) matrix 
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of an electron-hole pair (exciton) . 

When a nanocluster is optically excited to create an electron-hole pair, the final state 
has approximately the same atomic configuration as the initial state. Hence the excitation 
is designated as being vertical. The pair-excitation energy is the total energy difference 
between the ground state energy and the energy of the system with the electron-hole pair 
[21, 65]: 

E A = E(N;e + h) - E(N). (1.3) 

Eq. (1.3) accounts for quasiparticle effects, i.e. the reaction of the system to the presence of 
an extra electron or hole, and the attractive electron-hole Coulomb interaction. The lowest 
pair-excitation energy is describes the onset of optical absorption. 

After the optical excitation and prior to emission (elecron-hole recombination), the co- 
ordinates of the constituent atoms relax due to the change of the electronic density, see 
Figure 1.1. The atoms reach a minimum energy configuration in the presence of the electron- 
hole pair, which is different than the ground state atomic configuration. The change of the 
atomic coordinates results in a change in the spectrum; hence the energy levels involved 
in emission are different than in absorption. This leads to a redshift of the emission lines 
relative to the absorption lines. The emission energy is defined as the total energy difference 

E E = E*(N;e + h) - E*(N), (1.4) 

where (*) designates that the atomic coordinates from the excited system. The Stokes shift 
is identified as the difference between the onset of optical absorption and the lowest emission 
energy: E A — E E . It is assumed that the relaxation of the atoms occurs at a faster rate than 
the electron-hole recombination [21, 90]. 

The calculations reported in this thesis are for vertical excitations that correspond to 
absorption; structural relaxation of the excited state was not performed. Therefore the most 
appropriate experimental comparison is to photoabsorption measurements. Many experi- 
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ments measure the emission spectrum which is redshifted with respect to the absorption 
spectrum. 




Atomic Relaxation 



Figure 1.1: Schematic representation of the excitation process. Position (1): the nanocluster 
is in its electronic ground state and the atomic coordinates of the constituent atoms are in 
their lowest energy configuration. (1) — > (2): the nanocluster undergoes a vertical electronic 
excitation after absorbing a photon. (2) — > (3): the atomic positions relax to the lowest 
energy configuration in the excited state. (3) — > (4): the electron and hole recombine by a 
vertical transition. Figure originally from Ref. [21]. 



At this point it is necessary to introduce atomic units which are used throughout this thesis. 
Atomic units are often more convenient to use instead of the International System (SI) of 



The Schrodinger equation for a single-electron in a Coulomb potential in SI units is 



1.5 Atomic Units 



units. 



' h 2 

2m e 



V 2 




<j) = E(j). 



(1.5) 



9 



To write the Schrodinger equation in dimensionless form make the following change of 
variables, {x,y,z} — > {Ax', \y', Xz'}, where the primed variables are dimensionless. The 
Schrodinger equation, Eq. (1.5), can be rewritten as 



h 2 



7'2 



2m e A 2 



47renAr' 



E(j)'. 



;i.6) 



It is possible to solve for A by the equality 



h 2 



m e \ 2 47re .A 



E a , 



;i-7) 



to yield 



Aire h 2 



2 ' 



Divide E by E a to rewrite the Schrodinger equation in atomic units 



>' = E'd>', 



;i.9) 



where E' = E/E a . This is equivalent to setting h = e 2 /(47reo) = m e = 1. To convert length 
from atomic units to SI units multiply by the Bohr radius ao 



47r e h 2 , . o A 

a = 2 = 0.5291772108(18)A 



;i.io) 



to convert energy to SI units multiply by the Hartree E a : 



E a 



(Aire ) 2 h 2 



27.2113845(23)eV. 



1.11) 



In atomic units the value of the speed of light in a vacuum is the inverse of the fine 
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structure constant a 



1 



a = 7.297352568e - 3 « 1/137.04. 



(1.12) 



c = — 



The value of these physical constants are available at Ref. [91]. 

1.6 Effective Atomic Units 

Effective atomic units are mainly useful for quantum dot calculations. The iV-electron 
quantum dot (QD) Hamiltonian can be written as a sum of single-particle terms and an 
electron-electron Coulomb interaction 



where m* is the effective mass and e* is the dielectric constant for the semiconductor. The 
effective atomic units are characterized by h — e 2 /(47re*) = m* = 1. The effective Bohr 
radius a* and the effective Hartree E* are related to a and E a : 



r N h 2 p 2 N 1 

2m* 1 K ' 47re* ^ ri - r f 

. i=l J «<j J 



(1.13) 





(1.14) 
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Chapter 2 

Theoretical Methods: Single-Body 
Methods 

2.1 Introduction 

This chapter is focused on the mean field methods of Hartree-Fock (HF) and Density Functi- 
nal Theory (DFT) in the Kohn-Sham scheme. These two methods are important because 
they are the starting point for essentially all correlated-electron calculations. As we will see 
in Chapter 6, the Quantum Monte Carlo methods uses the results of either HF or DFT to 
construct a trial wave function. 

In HF exchange is treated exactly, the correlation energy is the difference between the 
HF energy and the exact energy of the many-body system. Hence correlation is neglected in 
HF. On the other hand, in DFT exchange and correlation are treated approximately by an 
exchange- correlation functional. See Section 1.3 for a thorough discussion of exchange and 
correlation. 

DFT in the Kohn-Sham scheme is very successful for describing ground-state proper- 
ties; however, it is far less successful in describing excitations. The Kohn-Sham eigenvalue 
differences are often used to calculate optical excitations without any formal justifications. 
In order to properly describe excitations it is necessary to extend DFT to time-dependent 
systems. The resulting theory is known as time-dependent DFT (TDDFT). 
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2.2 Density Functional Theory 



Density Functional Theory (DFT) [92, 29, 93, 94] is based on the theorems of Hohenberg 
and Kohn (HK) [95]. The first theorem states that for a system of interacting particles, 
the ground-state density no(r) uniquely determines the external potential V ex t up to an 
additive constant: the inverse relation is also true. A corollary of this theorem is that all 
the properties of a system are determined by the ground-state density. This is because 
the density determines the potential, and therefore the Hamiltonian, which determines the 
many-body wave function for all states (ground and excited) from which physical properties 
are determined. The second theorem states that the energy of a system is a functional of 
the density E[n(r)}; the functional has its variational minimum at the exact ground-state 
density. The energy functional is the sum of an interaction term and a term due to the 
external potential 



Fhk\p\ is a universal functional, which has no dependence on the external potential V ex t(r). 

The idea of Kohn and Sham (KS) [96] was to replace the interacting many-body sys- 
tem with an auxiliary independent particle system with the same electron density. The 
Hohenberg-Kohn energy functional and density can be expressed in terms of single particle 
orbitals The form of the energy functional in the Kohn-Sham scheme is 



where T [n] is the kinetic energy of the system of density n without electron-electron inter- 
actions 




(2.1) 



E[n(r)} =T [n(r)] + 




+ E xc [n{v)] + / dv n(r)V ext (r), (2.2) 




(2.3) 
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E xc [n(r)] is the exchange-correlation energy and the density is given by 



n 



(r)=$>*( r )| 2 - 



(2.4) 



This can be interpreted as a mapping of the interacting many-body system to a system of 
non-interacting electrons in an effective potential due to the other electrons; the mapping 
is exact if i£ xc [n] is known exactly. The functional in Eq. (2.2) is minimized, with the 
orthonormality constraint J drip*(r)ipj(r) = 5ij, to produce the set of Kohn-Sham equations 



and V xc (r) is the exchange-correlation potential. More formally, the exchange-correlation 
potential is the variational derivative of the exchange-correlation energy 



The many-body terms are incorporated into the exchange-correlation functional of the den- 
sity. 

It is important to note that the single particle energies {e,} obtained from the Kohn- 
Sham equations do not have any physical meaning; there is also no counterpart in DFT to 
Koopmans' theorem, see Section 2.3.2. 




(2.5) 



where V#(r) is the Hartree potential 




(2.6) 




(2.7) 
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2.2.1 Self- Consistency and the Kohn-Sham Equations 

The Kohn-Sham equations, Eq. 2.5, are solved iteratively. As a first step an initial guess 
is made for the density n, this is the zeroth iteration. The density is used to construct 
the corresponding effective potential: V e ff(r) = V ext (r) + Vh[ti\ + V xc [n\. The Kohn-Sham 
equations are solved for the effective potential to obtain a new density, which leads to a new 
effective potential. This process is repeated until self- consistently. The effective potential 
determined by this method is known as the self-consistent or mean field. 

2.2.2 Approximations to the Exchange- Correlation Functional 

The exchange-correlation energy E xc [n(r)} contains Coulomb correlation effects beyond the 
Hartree approximation (the Vh term) and part of the kinetic energy for the interacting 
system (T — To), where T is the interacting kinetic energy. For real systems E xc [n(r)\ is not 
known: the most widely used approximation for E xc [n(r)] is the local density approximation 
(LDA). The LDA uses exact expressions for the exchange energy and various interpolation 
schemes to numerically exact quantum Monte Carlo (QMC) correlation energies for the 
homogeneous electron gas [97, 98] 



where e x ° m {n) is the exchange-correlation energy per electron of a uniform electron gas of 
density n. [96] The generalization to spin-polarized systems, where the total energy is a func- 
tional of the spin-up rv and spin-down densities n^, is the local spin density approximation 
(LSDA) [93]. The LDA can be applied to inhomogeneous systems, where it has been shown 
to give a qualitatively good description of the ground-state properties. However, it is well 
known that the accuracy of DFT-LDA is limited and almost always overestimates the bind- 
ing energy and underestimates the bond lengths of molecules and solids [93, 29]. In addition, 
one of the faults of LDA and LSDA is that the self-interaction terms are only approximately 




(2.8) 
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canceled, in contrast the self-interaction is exactly canceled in the HF approximation. 

The generalized gradient approximations (GGA) [92, 99, 99, 100, 101], which include 
gradient corrections to the local density, are an improvement to LDA. Exact exchange (EXX) 
[102] is a way to include the correct exchange within the Kohn-Sham DFT formalism. 



2.3 The Hartree-Fock Method 
2.3.1 Introduction and Basic Theory 

The Hartree-Fock method [103, 104, 16, 105, 4, 106, 107, 108, 29] is an independent-particle 
approximation to solve the iV-electron system with the non-relativistic Hamiltonian 

N \ „ N 



,i=l / ijtj 



where V ext is the external potential. For atoms and molecules V ext = —Z/\r — R Q | unless the 
core-electrons are replaced by a pseudopotential. This Hamiltonian can be written in the 
form 

H = H 1 + H 2 , (2.10) 
where H\ is the sum of N single-electron terms 

H x = X>(0, Ki) = -\v 2 t + V ext (r t ) (2.11) 

i 

which consists of the kinetic energy of a single-electron and the interaction of a single-electron 
with the external potential, and H 2 is the sum of N(N — l)/2 two-electron terms 



#2 = 5>(z,j), 9{iJ) = — (2-12) 



which consists of the electron-electron interactions (with = | — r j I ) - 

16 



In Hartree-Fock the electrons are treated as non-interacting with the exception that they 
must obey the fermion exclusion principle; correlations are treated at a Hartree interaction 
level and not in a true many-body sense. The method uses as a trial wave function a single 
Slater determinant of spin-orbitals 



^i(xi) $j(xi) 

^l(x 2 ) ^2(X 2 ) 
^l(Xiv) ^2(xa0 



^jv(xi) 
?Mx 2 ) 

iPn(*n) 



[2.13) 



where Xj = {r^, o"j} represents the space and spin coordinates of the zth electron. This trial 
wave function automatically obeys fermion particle exchange anti-symmetry 



^(xi, . . . , x^ . . . , Xj, . . . , xtv) = -^(x 1; . . . , Xj, . . . , Xi, . . . , Xjv), : (2.14) 

interchanging any row or column of the Slater determinant results in a sign change. The 
spin-orbitals are constrained to be orthonormal 



(^ j ) = S ij (2.15) 

and can be represented as a product of a spatial orbital ^(r) and a spin function (a for 
spin-up or /3 for spin-down). Since the spin-up spin-orbitals are automatically orthogonal 
to the spin-down spin-orbitals, the condition in Eq. 2.15 reduces to the condition that the 
spatial part of the spin-orbitals with the same spin are orthonormal. From Eq. (2.15), it 
follows that \& is normalized, since the spin-orbitals that form the matrix elements 

of the Slater determinant are orthonormal. 

The expectation value of the energy is given by the sum of the expectation values of the 
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single-electron operators and the two-electron operators: 

(H) = {H 1 ) + (H 2 ), (2.16) 

where ( ) denotes the average with respect to The expression for the single-electron 
operators is given by 

(*|fr 1 |*) = ^j i , (2.17) 

i 

where 

Ii = (^(xOI/iil^Xi)) 

= J dv 1 r i {ri)h 1 ^ i {v 1 ). (2.18) 

The expression for the two-electron operators consists of the sum of the direct and exchange 
terms 



The direct term 



2 

* j 



Jij = (V>i(Xi)^j(x 2 )|— | < 0i(Xi)^ i (x 2 )) 

?"12 



= / ^^^(n)^*^)— ^(rO^-Cra), (2.20) 

is the average value of the interaction l/r 12 when electron 1 is in the spin-orbital ipi and 
electron 2 is in the spin-orbital ipj. The exchange term 

Kij = (^t(xi)^(x 2 )| — |^j(xi)^(x 2 )) 

r 12 

= / dndr 2 ^(ri)^(r 2 )— ^-(ri)^(r 2 ), (2.21) 

J r 12 

is the matrix element of the interaction l/r 12 with respect to ^>j(ri)^>j(r 2 ) and ^(ri)^^) 



- electron 1 and electron 2 are interchanged. 



The Hartree energy is commonly written as 



E H 



dridr 2 



n(ri)n(r 2 ) 



(2.22) 



which represents the self-interaction of the density n(r) : this is the reason why the i = j term 
is included in the double summation. The self-interaction in the Hartree term is canceled by 
a corresponding term in the exchange. It is important to note that the exchange term only 
contributes for same spin electrons; this is a direct result of orthogonality of the spin-orbitals. 

Using Eqs. (2.17, 2.18, 2.19, 2.20, 2.21), the expectation value of the Hamiltonian, 
Eq. (2.9), with respect to the Slater determinant trial wave function can be written as: 



The sum over the spin-orbitals runs over all the occupied spin-orbitals, we have not explicitly 
included any occupation coefficients. 

The energy E[^>] is a functional of the Slater determinant, or the constituent spin-orbitals, 
and is variational: Eq < E[^f], where Eq is the exact ground-state energy of the system given 
by the Hamiltonian in Eq. (2.9). The goal is to find the minimum of this functional with 
respect to the spin-orbitals: the spin-orbitals corresponding to this minimum are occupied 
to construct the ground-state of the system. The minimization must be performed with 
the orthonormality constraint of the spin-orbitals in Eq. (2.15); the result being the set of 




^ W ^r^(r)^(rO-r^7T^(r)^(r'). (2.23) 
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equations 



r-iv 2 + y e 4r)W(r) + Y,j dr 



r — r' 



E / rfr V 'W = e^(r), (2.24) 



r — r 



where the sum j is over all the occupied spin-orbitals and is a Lagrange multiplier. A 
more compact form of Eq. (2.24) is given by 



1. 



-y 2 + v ext (r) + v H + v : 



X 



-0i(r) = e^(r) 



(2.25) 



where we have introduced the direct operator 



V„Ur) = I dr l *'' (r ' )l 



r — r 



r\\2 



(2.26) 



and the non-local exchange operator 



, ^(r')^(r') 



(2.27) 



The Hartree-Fock equations can now written in a form analogous to the Schrodinger 
equation with an effective potential 



-^ 2 + Kff 



(2.28) 



where 



v: ff = v ext (r) + n + Vx- 



(2.29) 



It is important to note that V^f explicitly depends on the spin-orbital ^ and is a non-local 
operator, hence the labeling. 
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The HF equations, Eq. (2.24), are solved self-consistently in a process that is very similar 
to the method to solve the Kohn-Sham equations. The HF equations are typically either 
solved in a spin-restricted or spin-unrestricted fashion. For spin-restricted systems, each of 
the spin-orbitals is doubly occupied by one spin-up electron and one spin-down electron. 
The ground-state is constructed by filling the first N/2 spin-orbitals. For spin- unrestricted 
HF, the spin-orbitals are allowed to have an explicit spin dependence; each spin-orbital is 
only singly occupied. Since the spin-unrestricted version of HF has more variational freedom 
than the spin-restricted version, the total energy from an unrestricted HF calculation is lower 
than the total energy of a restricted HF calculation. 

Taking the scalar product of Eq. (2.24) with ^(r) and using Eqs. (2.18, 2.20, 2.21), it 
can be shown that the single particle energy is given by the expression 

e i = I i + Y^[Jij-Kij}- (2-30) 

3 

This relation implies that the total Hartree-Fock energy can also be given by 

i i j 

An important observation is that the total energy is not the sum of the individual eigenen- 
ergies - the electron-electron interaction energies are double counted. 

2.3.2 Koopmans' Theorem, Orbital Relaxation and the Delta 
Self- Consistent Field Method 

Koopmans' theorem states that the eigenvalues from the HF equations can be identified as 
total energy differences. For example, the energy to remove an electron occupying the orbital 
ipi corresponds to the eigenvalue 6j. Koopmans' theorem is based on the assumption that 
the single-particle orbitals are frozen upon adding or removing an electron. In reality the 
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orbitals change when an electron or hole, or an electron-hole pair, is added to a system. This 
phenomenon is known as relaxation. It is possible to include the effects of orbital relaxation 
by the delta self-consistent field (ASCF) method. In the ASCF approximation, addition 
and removal energies are calculated as total energy differences, i.e. the energy difference of 
two SCF calculations. Weissker and co-workers [63, 64, 65] have used the ASCF method to 
calculate the optical properties of the Si n H m and Ge n H m nanoclusters. 

2.4 The Radial Hartree-Fock Equations: Application 
to Atomic Systems 

From Section 2.3.1 the Hartree-Fock equations, Eq. (2.24), can be written in terms of the 
spatial part of the spin-orbitals. For atomic systems, or any system with a spherically sym- 
metric central potential, it is possible to solve the set of Hartree-Fock equations numerically 
on a radial grid by using the Numerov algorithm, see Section A. 5. The integrals for the 
matrix elements of the Hamiltonian are solved by a method discussed in Appendix A. 4. In 
general it is necessary to use a basis, such as Gaussian or Slater type orbials, to solve the 
Hartree-Fock equations. 

The derivation of the radial HF equations closely follows that of Refs. [16, 105]. As part 
of this thesis we have written a numerical HF code [109] for atoms and quantum dot systems 
with a harmonic central potential. The results from this code represent the HF basis set 
limit: the result of a HF calculation with an infinite basis. 

2.4.1 The Spin-Orbitals 

For an atomic system the spin-orbital ip% index % represents the set of four quantum numbers 
(rij, li, rrii, Si). For spherically symmetric external potentials the spatial part of a spin-orbital 
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can be written as a product of a radial function and a spherical harmonic 



^(r) = R nl {r)Y L 



lm\ 



u n i{r) 



Y, 



lm\ 



(2.32) 



The spherical harmonic is characterized by the quantum numbers (l,m) (for a detailed 
discussion of the spherical harmonics see Appendix A.l). In general the radial component 
R n i(r) can be characterized by the quantum numbers (n, /, m, a) - a more detailed discussion 
of this is given later in this section. 

The radial function obeys the normalization condition 



dr r 2 \R nI (r)\ 2 



dr \u n i(r)\ 2 = 1. 



(2.33) 



Applying the new representation of the spin-orbital to the radial Schrodinger equation for a 
potential V(r) 

-iv 2 + V(r) Vw(r) =e nlm ^ nJm (r), (2.34) 
where the Laplacian in spherical coordinates is given by 



„ 2 i d f 2 d\ i d ( . n d \ i d 2 

V 2 = — — r 2 — + — - — ; — — — — sin^- + 



r 2 dr \ dr J r 2 sin 9 09 \ 39 J r 2 sin 2 9 d(p 2 



(2.35) 



leads to (R = u/r, R' = u'/r - u/r 2 , R" = u"/r - 2u'/r + 2u/r i ) 



1(1 + 1) 
2rfH + 2r 2 



V(r) 



u n i{r) = e nl u n i(r). 



(2.36) 



The analytic solution for the eigenvalue for the one-electron atom with the central po- 
tential V(r) = —Z/r is given by the expression 



2n 2 ' 



(2.37) 
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where n is the principal quantum number. The restriction on the angular momentum quan- 
tum number is < I < n. The solution to the radial functions R n i(r) are related to the 
Laguerre polynomials and can be found in Refs [110]. The nodes of the radial functions are 
determined by n — I — 1, with the s(l = 0) states having the most nodes. The nodes are 
intimately related to the orthogonality constraint. 

For closed-shell systems it can be shown that the charge density n(r) has spherical 
symmetry 



n(r 



T^, , M|2 _ "y^"" \u nl (r)Y lm (6,(j))\ 2 _ 7"2(a + l)Mr)f 

j nlmcr n,l 



by use the relation in Eq. (A. 10). Closed-shell means that each spin- up electron is paired 
with a spin-down electron. From this observation it is not difficult to show that the effective 
potential also is spherically symmetric. For a closed-shell system the radial orbitals u(r) only 
depend on the quantum numbers (n,l). For an open-shell system, such as the ground-state 
of the Ge atom, the radial function can depend on all of the quantum numbers (n, I, m, a). 
It is possible to perform a restricted HF calculation where orbitals with the same (n,l,m) 
or (n, I) quantum numbers are forced to be identical. 

2.4.2 The Single-Electron Operators 

The single-electron operators include the Laplacian, Eq. (2.36), and the external potential. 
Evaluating the matrix elements of the single-electrons operators poses a challenge due to 
the second derivative introduced by the Laplacian. The evaluation of the second derivatives 
can be avoided by noting 



dr V>*(r) 



'o 



^V 2 + V ext 



2 



POO 

/ dr #(r) [* - (V: ff - V ext {r))] ^(r), (2.39) 
Jo 
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where e« is the eigenvalue. This allows the total energy to be written as 

E = J2^~Eh-E x . (2.40) 

i 

2.4.3 Two-Electron Operators: the Hartree Term 
The Potential 

The Hartree operator acting on the zth orbital 

j l^(r')| 2 



V H Uv) 



r — r 



^(r), (2.41) 



can be written in terms of the radial functions itj(r) = rRi(r), Eq. (2.32), with help of the 
following two identities: 

1. The multi-pole expansion 

1 °° k r k Air 

11 21 k=0 q=-k > 



where r< is the smaller and r> is the larger of T\ and r-i- 1 
2. The integral relation among the spherical harmonics 



dm 3 , m s(9,<f>)YY h , mi (9,m 2 ,rn 2 (9,<f>) = \l (2 ^ ( 2^+ 1 i| ^ (hmil2m 2 \l 3 m 3 ){h0l 2 0\l 3 0). 

(2.43) 



^ee Eq. (3.70) on pg. 102 of [111] 
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which obeys the following conditions: 



mi + 777,2 = ^3 

h + h + h = even, else 

h + h > h> \h-h\. (2.44) 



Using identities 1. and 2., the Hartree term takes the form 



°° A POO 

j k=0 q=-k J ° 



K^( r ')-ferx 



^(0,0). (2.45) 



The most generic form of the radial integral over the radial functions from the previous 
equation 



roc a; 

/ dr'n^u^ir')-^ (2.46) 
can be written in terms of the function introduced by Slater 2 
yk{nik',njlj/r) _ 1 



r k+l 



r poo 

dr' r' k u* nik (r')u njlj (r f ) + r k / dr' r'^u^ (r')u njlj (r f ) (2.47) 



o 



Now it is possible to write the Hartree term as 



j k=0 q=—k 



2 See Eq. (17-27) on pg. 17 of [105] 
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with the restrictions 



rrij + q = rnj q = 
lj + k + lj = even, else 

lj + k > lj > \lj - k\ => < fc < 2Zj-. (2.48) 



To eliminate any dependence on 9 or perform the integration 



which leads to 



EEE 



4tt ykj njl^rijlj/r) u nik (r) 



. 2fc + 1 

j k=0 q=—k 

(ljmMh m jWjMQ\lp) J dn [ Y k,mi{9, <t>)]*Y kq (9, (f>)Yi i)mi (9, 4>) 

7 fc=0 q=—k 

{Ijmjkq\ljrrij){lj0k0\lj0) (limikq\limi) (Ii0k0\li0) 



with the additional restrictions 



(2.49) 



m>i + q = rrii ==>- g = 
Zj + A; + /j = even, else 

h + k > U > \U - k\ =>• < k < 2k. (2.50) 
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Utilizing the symmetry properties of the Clebsch-Gordan coefficients [112] 



{hmil 2 m2\hm 3 ) 



l 2 - m 2 h -mx\h- m 3 )) 

-l) ll+l2+l3 (l2m 2 hmi\l 3 m 3 ) 

-l) ,1+,a+,8 (Zi - mxh - m 2 \l 3 - m 3 ) 
/ (2/ 3 + 1) 



■1)' 



■1) 



h— m,2 



2l 2 + l 

' (2/ 3 + l) 
2h + l 



■{hmih - m 3 \l 2 - m 2 ) 



(2.51) 
(2.52) 
(2.53) 

(2.54) 



(l 3 — m 3 l 2 m 2 \h — mi) (2.55) 



and properly taking into account the restrictions, it is now possible to write the final solution 



11 - 



min(2ii,2/j) 

E E (-i)- mi - mj 

j fc=0 



(Z i m^i(-m i )|fcO)(Z i O^O|fcO) 



(2/ i + l)(2/ t + l) 
(2k + l) 2 

ykirijlfrijlj/r) u n%h (r) 



(^mM-m^kOHljOljOlkO) x 

(2.56) 



The Hartree Energy 



The Hartree energy is closely related to the result in Eq. (2.56) 



t,j ^ |r r 1 

= E / dr <«/i( r )^^^( r )> 



)^(r) 



(2.57) 



where the summation includes the i = j self-interaction term. This equation can be simplified 
by use of the identity 3 



1 



Rk(ij;rt)= / drul il .(r)u nr i r (r)-y k (n j l j ;n t lt/r) 



(2.58) 



3 See Eq. (13-19) on pg. 310 of [16] 
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to yield the result 



min(2£;,2£j) 

E » = E E (- 1 

i,j k=0 



(2Z i + l)(2Z i + l) 



(2fc + l) 

(hmik(-mi)\kO) (Ii0li0\k0)7l k (ij; ij) 



(Ijrrijlji-mj) \ kO) {Ij0lj0\ fcO) x 



(2.59) 



It is convenient to rewrite the result for the Hartree energy in terms of the Gaunt coefficients 



2/2 + 1 

c k (limi]l 2 m 2 ) = — — — (/cOZ 2 0|/iO)(fc(mi - m 2 )/2m 2 |Zimi), 
Zl\ + 1 



'2.60) 



such that 



min(2£;,2Zj) 
i,j k=0 



(2.61) 



2.4.4 The Exchange Term 



The Exchange operator acting on the ith orbital 



r — r 



(2.62) 



can be written in terms of the radial functions Ui(r) by following the same steps as outlined 
in deriving the Hartree term. 

Applying the identities in Eqs. (2.42) and (2.43) leads to 



00 k 



Air 



Yl- 



k=0 q=-k 



dr' ' < ]h {r')u nik {r')-^ x 
r > 



]>m .{e\<j>')rY k ^e\^)Y kq {e'A') 



U n Alr) 
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Now substitute the radial integral in Eq. (2.47) to yield 



oo k 



k=0 q=-k 



(2fc+l)(2Zj + l) 
4 7 r(2Z i + 1) 



2& + 1 



{ljmjkq\limi) {Ij0k0\li0) 



with the following restrictions 



rrij + q = rrii ==>- g = m, — m,- 
lj + k + li = even, else 

Zj- + A; > Zi>|Zj-A;| \lj - U\ <k <lj + k 



(2.63) 



To eliminate any dependence on 9 or perform the integration 



cM [Y h , mi (9,<P)Y 



dr 



, ^(r')^(r') 



r — r 



(2.64) 



which leads to 



oo k 



4tt(2Z j + 1) y k (njl s ; Uik/r) u njlj (r) 



(2fc + l)(2Zi + l) r 



x 



r 



j fc=0 q=—k 

(Ijmjkqlhm^iljOkOlhO) J cM (Y hjmi (9, $*W0, </>) 

00 fc 2/ + 1 

E ^ T (^^A;g|Z i m i )(Z,0fc0|Z i 0) x 

j fc=0 q=—k 1 



(kqljinjlliirii) (kOljO\liO) 



yk{njlj\nik/r) u njlj (r) 
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with the additional restrictions 



rrij + q = rrii => q = rrii — rrij 
k + k + lj = even, else 

lj + k > h > \lj - k\ \lj - h\ < k < lj + k. (2.65) 

Utilizing the symmetry properties of the Clebsch-Gordan coefficients, specifically Eq. (2.55), 
and applying the restrictions yields the final result 

= -EC E (2 '' m+lV ^ U-m^mMrnt - m,)? x 

W-0|ifcO) 2 yfc(n ^ nA/r)M ^ (r) . (2.66) 

The Exchange Energy 

The Exchange energy is closely related to the result from Eq. (2.66) 

Ex = -J2 I dv'dv^v)^')— 1— ^.(r)^(r') 
id l r r I 

/"OO 

= -E / ^Jr^^r), (2.67) 
4 ./o 

where the summation includes i = j self-interaction energy. This equation can be simplified 
by use of the identity from Eq. (2.58) to yield 

E X = -£<W, £ {2k l 1} | 2 i + 1} ~ mj)) 2 X 

(/^•O^O) 2 ^^;^) (2.68) 
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In terms of the Gaunt coefficients, Eq. (2.60), the exchange energy has the form 

E x = -J2$<n,* j Yl ^{kmfJjm^KkiiyJi). (2.69) 

3 ^ = \^j I 

2.4.5 Transforming the Non-Local Exchange Operator to a Local 
Operator 

The non-local exchange potential in the Hartree-Fock equations represents a difficulty for 
numerical integration. We would like to transform the non-local exchange operator, if at all 
possible, to yield an effective local operator: 

V x Vx(r). (2.70) 

The advantage of such a transformation is that the efficient Numerov algorithm can be used 
to solve the Hartree-Fock equations. Also, such a transformation facilitates the generation 
of Hartree-Fock pseudopotentials [17]. 

The general form of the radially non-local exchange term is given by 

V x u nih {r) = f dr' ^^fiMryK^UrHkir'), (2.71) 

3 k 

where the index j is summed over all of the occupied orbitals and the non-local fi t j t k{ r , r ') 
correspond to terms in the multi-pole expansion of the Coulomb potential. This equation 
can be modified by multiplying and dividing by it ni ^(r) to yield the local operator 

u n . lk (r). (2.72) 

This effective local operator is used in place of the non-local operator in the radial equations. 
Special care must be taken at the nodes of the radial function to prevent divergences: 
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V x u n . u (r) 



typically a ceiling is applied to the ratio (u n .i 3 (r)/u ni i % (rj). 



2.5 The Scalar- Relativist ic Approximation 

The purpose of this section is to derive and discuss the ideas behind the scalar-relativistic 
approximation. The derivation begins with the Dirac equation for an atom; the Dirac 
equation is the relativistic counterpart for the non-relativistic Schrodinger equation. The 
scalar-relativistic approximation can be incorporated in SCF theories such as HF and DFT. 
We are not discussing the relativistic counterparts of HF (Dirac-Fock) or DFT in this thesis. 

2.5.1 The Dirac Equation 

The Dirac equation (in atomic units) 



i— = (a -1 a ■ p + Pa~ 2 )^ = H^f, 
at 



[2.73) 



is a relativistically covariant equation with positive definite probability density. The quantity 
\l/ is a four- component single-particle wave function which describes spin-1/2 particles and 
p = —i'V is the momentum operator. The matrices cti and f3 obey the following algebra 



ft^ftfc + OL k OLi 

+ (3oti 







(2.74) 
(2.75) 



where I4 is the four-dimensional identity matrix. We will choose the following representation 
for the four Dirac matrices: 



ft; 



a,-,. 



<Ti 







-U 



[2.76) 
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where are the 2x2 Pauli spin matrices and the unit entries of (3 stand for 2 x 2 unit 
matrices. The standard representation for the Pauli matrices are 



^2.77) 





( A 




fo -d 




d 




01 = 








, ^3 = 








v 1 °) 










-v 



Coupling a Dirac particle to an electromagnetic interaction of the form 

A IA = (V(r), 0,0,0) 



(2.78) 



and using the minimal substitution (i.e. — > p M — aA^) reduces the Dirac equation, 
Eq. (2.73), to 



5 



— W = ra _1 a-p + /9a~ 2 + 7(r)l 



^2.79) 



The quantity V(r) is assumed to be a spherically symmetric, i.e. a function of r = |r| 
spin-independent potential. We assume for the wave function the stationary state form 



*(r,t) = e 



-jet 



' 4>(r) ' 



y x( r ) y 



(2.80) 



where 0(r),x(r) G C 2 are time-independent two component spinor describing the spatial 
and spin-1/2 degrees of freedom. For references on the Dirac equation see [113, 114, 115] 



2.5.2 Symmetries of the Motion 

The Dirac Hamiltonian, Eq. (2.73), has rotational and inversion symmetry. The total angular 
momentum operator 



V 



L + \a 
L + h 



J 



(2.81) 
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and the parity operator 



P = e^P, 



[2.82) 



commute with the Hamiltonian. The phase e lr ^ equals ±1 if we require P 2 = 1. The phase 
is related to the intrinsic parity of the particle or state and will be denoted rip. Therefore, 
the eigenfunctions have well defined angular momentum and parity. 

2.5.3 Spin- Orbital Angular Momentum States 

The two-component angular solutions, which are simultaneous eigenfunctions of the opera- 
tors J 2 , J z , and the spin-orbit coupling term ~ L • S, can be constructed from the spherical 
harmonics Yi m (r) and the spin-1/2 states 



Xi i 

2 '2 



vV 



Xi -i 

2' 2 



[2.83) 



The solutions have the form 



(l,m- -\j,m)Y h[ 



f)Xl i + 



(l,m+ -; -,--|j,m)yj )m+ i(f)x|_i, 



'2.84) 



with 



m ± a; 



(2.85) 



Since the quantum numbers (lm') are integers, (jm) are half-integers. From the triangle 
inequalities \l — 1/2 1 <j< |/ + 1/2| , we conclude j = /± |, or equivalently /= j =F 1/2. This 
means that a given value of j corresponds to two different values of I, or a given value of / 
corresponds to two values of j : the exception being the case of / = 0, which has a unique 
value of j = 1/2. 

Using the proper Clebsch-Gordan coefficients, the spin-orbital angular momentum states 
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are: for j — I + 1/2 



C 1 



/ + 1/2 + m 
21 + 1 



K TO _i(f)xi,+ i 



/ + 1/2 - m 

2 / + l W»)**.-* (2.86) 



and for j = / — 1 / 2 



1 l ^ 



/+ 1/2 -m, 
2/ + 1 



!,m- 



; (r)x 



2 ' T 2 



/ + 1/2 + m 
2/ + 1 



r, m+ i(r) X i i. (2.87) 



2.5.4 Eigenvalues of the Spin-Orbital Angular Momentum States 

For the states in Eqs. (2.86, 2.87), it holds 



(2.88) 



and 



^%m(j =F 2 l r ) = =F -, -|r) 



(2.89) 



Furthermore, 



J = (L + S) 2 = L + S* + 2L • S 



(2.90) 



or equivalently, using the identity S = cr/2 



L • <r = J 2 - L 2 



(2.91) 



For j = 1 + 1/2 



L-a<f jm (j- -, -|f) 



J(j + l)-(j-i)(j-^) 



J "2 



^jm(j 



ftm(j - -, -|r) 



-, - r) 

2 ' 2 1 ; 



(2.92) 
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and for j = V -1/2 



L ■ <rip jm (j + -\r) 



3{] + ^)-{3 + \){3-\)-\ 



ftmU + 2' 



ftmU + 2' 2"l r ) 



(2.93) 



2.5.5 Parity and the Operator cr • f 

The effect of the parity operator, Eq. (2.82), on the state of the form 



is given by 



V 



X(r,<) 



/ 



P*(r, t) = n P /5*(r, t) = ¥'(-r, t), 



(2.94) 



(2.95) 



which leads to 



*'(r,t) = n P 



<K-r,t) 



[2.96) 



\ -x(~r,t) J 

For a state with definite parity (i.e. it is unchanged by the transformation ty' = we have 
the condition 



0(r,t) = n P (f)(-r,t) 
= -n P x(-r,t). 



(2.97) 



The upper component of the Dirac wave function has the same spatial parity as that of the 
overall state, while the lower component has the opposite spatial parity. This is due to the 
action of the f3 matrix, which gives an extra phase to the \ spinor. An operator of odd 
parity, which is a linear combination of Yj m (f), can be used to transform one state into the 
other, since their / values differ by 1. 
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The inversion properties of the spherical harmonics state that when r = (r, 9, <p) is re- 
placed by — r = (r, 7r — 9, ir + 0): 



Y lm (ir -9,tt 



-l) l Y lm { 



(2.98) 



For a given j, our spin-orbit angular solutions (fj m (j — |, (fjmO > 



1 |f) are of opposite 

parity ( i.e. their property to change only by a factor of ±1 under spatial inversion) since 



their / values differ by 1. 

The pseudo-scalar cr ■ f has odd parity: 



a ■ r 



rr ■ r. 



(2.99) 



In addition cr • f is Hermitian and when applied to the states (pj m (j =F h, ||r) changes their 
parity It can be shown that in the two-dimensional basis given by (Pj m (j =F |, ||f), cr ■ f has 



the diagonal form 



/ 



cr ■ r 



\ 



-1 

v- 1 v 



The result can be written as 



'2.100) 



a-rip jm {jT 



1 1, 
-<p jm {j ±-,-\r). 



(2.101) 



2.5.6 The Operator a ■ p 

The operator cr ■ p plays an important role in the radial Dirac equation. We want to de- 
termine its action on the functions of the form a(r)<fij m (j =F f |r). Using p = — zV and the 
following identity for the Pauli spin matrices 



(<r-a)(er-b) = a • bl 2 + zcr-(a x b). 



(2.102) 
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It is possible to obtain the relations 



(o-f) 2 = 1 2 



and 



<rp = — (ff-rff-p) 

(T-r) /Id cr L 

z- 



r \ i dr r 

We can now write: for j = / + 1/2 



1 1... (cr-r) (I d (j — |) \ , 1 1 
cr-pa(r> im (j - -,-|r) = I3T + Z — I 



2' 2' 7 r V i dr r 7 rj VJ 2' 2 



r 



. ^ da(r) (j - l) nf ^\ (cr • r) 1 



—i — — a r 



dr r J r 2 



and for j = V - 1/2 



cr-pa(r)^ im (j + -,-|r) = -— - i ^ jm (j + -,-|r 



2' 2' 7 r \idr r J rj ' w 2' 2 

a(r) (j + |) , A (cr ■ r) . 1 

2.5.7 The Radial Dirac Equation 

Using the state in Eq. (2.80) will produce the time-independent Dirac equation 

Hij) = eij) 
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where ip is the two-component spinor 



( , \ 



:2.108) 



In the case of electrons (positive energy solutions) is the large component and \ is the 
small component (i.e by a factor of a). Solving the eigenvalue problem Eq. (2.108) leads to 
a set of coupled equation for </> and % '■ 



a 1 ((T-p)x = (e-V-a 2 )<p 
« _1 (<r-p)0 = (e - V + a~ 2 )x- 



[2.109) 



The coupling is due to the cr ■ p term, which has odd parity and does not couple states with 
different sets of (jm) quantum numbers. The general solution can be characterized by its 
parity (—1)' and quantum numbers (jm): 



1> 



n] m 



( 



\ 



9nj(r)ip jm (j 



l l 

2' 2 I 



/ 



/ 



or 



1 m 



9nj(r)ip jm (j + 2 , 2 



(2.110) 



Note that Eq. (2.110) incorporates the results of Eq. (2.97) by explicitly using cp's with 
opposite parity to describe the upper and lower components. Using Eqs. (2.101) allows 
Eq. (2.110) to be simplified to 



V 



i 1 1 



9nj(r)(Pjm(j 2 , 2 
ifnj(r)<Pjm(j + 1 



\ 



2' 2 I 



or 



/ 



V 



g n j(r)ip jm (j + 
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2' 2 I 



1 ll 
2' 2 I 
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Applying Eq. (2.111) to Eq. (2.109) leads to: for I = j - 1/2 and V = j + 1/2 



/ df n j (j 2 ) 

a ^~dV —f^) = -(e-V-a )g nj 



a 



dr 

and for I = j + 1/2 and V = j - 1/2 : 

-1 (dfnj ] (J + |) 

\ dr r 
-i / ' dg n j {j 2 



(^-V^i) = (^^ + «- 2 )/- ( 2 - 113 ) 



At this point it is advantageous to introduce the quantum number k, which is related to the 
quantum numbers j, I and /' by 

if k = j H — , then I — j -\ — and I' = j =>- K = I, 

2 2 2 

if« = -O'+g)' thenZ = j-- and /' = j + -=►«= -Z - 1, (2.114) 
which allows Eqs. (2.112) and (2.113) to be written concisely as 

+ = (e-^ + a" 2 )/ nK . (2.115) 
\ dr r J 

By defining a new energy origin 

£ = £ -a~ 2 , (2.116) 
the factor multiplying / on the right-hand side can be written as 2M/a 2 , with 

M(r) = l + ?L[e-V{r)]. (2.117) 
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Using Eqs. (2.116) and (2.117) it is now possible to rewrite Eq. (2.115) as 



= a(V-e)g nK + { -^-f nK (2.118) 
dr r 

^ = Jjt±A 9m+ ™ u „ (2 , 19) 

dr r a 



Solving the second equation for / and substituting this into the first equation and carrying 
out the differentiation, we obtain 



1 1 d ( 2 dg nK 

r — : — + 



2M r 2 dr \ dr 
a 2 dV(l + K 



1 «(«+!) 
2M r 2 



a 2 dV dg nh 
AM 2 dr dr 



9nK = eg nK . (2.120) 



AM 2 dr r 

For each of the cases in Eq. (2.114), the factor of k(k + 1) can be written as 

k(k + 1) = 1(1 + 1) (2.121) 

Apart from the last two terms on the left-hand side, Eq. (2.120) looks like a radial Schrodinger 
equation. No approximations have been made yet. The formal mass term M is sometimes 
called the mass-velocity term, the term (dV/dr)(dg/dr) is known as the Darwin term, and 
the last term on the left-hand side of Eq. (2.120) is the spin-orbit coupling term; this is true 
because 

L • (T<p jm (j =F ^, ^|r) = -(1 + K)<p jm (j T \, \\r). (2.122) 
2.5.8 The Koelling-Harmon Approximation 

The essential approximation (Koelling-Harmon) [116] consists of dropping the spin-orbit 
terms from the radial equations, Eq. (2.120). The spin-orbit interaction is separated out 
explicitly and is normally treated variationally or by perturbation theory. The approximate 
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functions are called g and /, and take the form 



1 1 d 



2 dgni 
dr 



V 



1 1(1 + 1) 
2M r 2 



9nl ~ 



a 2 dV dg 



ni 



AM 2 dr dr 



eg n i 



(2.123) 



and 



a dg 



ni 



ni 



2M dr 



The latter is needed for normalization 



(2.124) 



(2.125) 



It should be noted that some versions of the scalar-relativistic approximation ignore the 
small component j n \ [116, 17]. Eq. (2.123) is the scalar-relativistic radial equation, and 
it may be used in place of the Schrodinger equation. Note that the (jik) index has been 
replaced by (nl) since there is no longer any j dependence. 
The spin-orbit coupling term 



H ^ ldV 1 « 

Hs0 -Uf 2 r^ L ' a 



(2.126) 



can be treated as a perturbation: the factor in Eq. (2.126) is largest near the nu- 
cleus. The evaluation of the spin-orbit term with respect to the scalar-relativistic orbitals is 
discussed in Ref. [116]. 



2.5.9 Solving the Radial Equations 

We would like to transform Eq. (2.123) to a form that can be solved by the Numerov 
algorithm, see Appendix A. 5. This is possible by the change of variables: let u(r) = rg(r), 
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with the derivatives u' = g + rg' and u" = 2g' + rg", which yields 



1 d 2 u 
' 2M dr 2 



V 



1 1(1 + 1) a 2 dV 
2M r 2 AM 2 r~dr~ 



u — 



a 2 dV du 
AM 2 dr dr 



eu 



[2.121) 



Now let Z(r) = u(r)/yM, this leads to 



d 2 Z 
dr 2 



1 d 2 u a 2 dV du 



3a 4 



M 1 / 2 dr 2 2M 3 / 2 dr dr 16M 5 / 2 V dr J 



dV\ 2 a 2 d 2 V 

U+ — M, 



4M 3/2 dr 2 



[2.128) 



and finally 



d 2 Z 
dr 2 



1(1 + 1) „, /T , . a 2 3a 4 
V = ; + 2M (V - e) + ttt^— + 



2Mr dr 16M 2 \dr J AM dr 2 



dV\ 2 a 2 d 2 V 



- 



[2.129) 



2.6 Results for Germanium 



This section presents the results from our numerical Hartree-Fock code for atomic Germa- 
nium. Figure 7.1(a) plots the radial orbitals for Ge ( 3 P) ground-state, and Figure 7.1(b) 
shows a close up of the valence orbitals. Table 2.1 compares the eigenvalues for non- 
relativistic and scalar-relativistic HF. 



2.7 Time-Dependent Density Function Theory 

Time-Dependent Density Function Theory (TDDFT) is a generalization of the static Hohenberg- 
Kohn-Sham theory to the time domain. In principle TDDFT can describe the excitation 
spectra exactly [117]; however, TDDFT is plagued by the same problem as static DFT via 
the exchange-correlation functional. The exact functional is not known and the accuracy of 
practical approximations is not established [118]. 

For references on TDDFT see Refs. [117, 118, 29, 119, 120, 121]. In static DFT, the 
ground-state of a system is determined by the minimum of the total energy. For the case 
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Figure 2.1: 7.1(a) The radial orbitals for the ( 3 P) ground-state of the Ge atom from all- 
electron scalar- relativistic Hartree-Fock. 7.1(b) The valence 4s and 4p orbitals. 
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Table 2.1: Comparison of the results of our radial HF code for scalar-relativistic (SR) and 
non-relativistic HF (NR) for the ground-state of the Ge atom ( 3 P). We also compare our 
non-relativistic results to the reference data of Fischer [4] . 



(a) Comparison of the eigenvalues. 



shell 


SR 


NR 


NR Ref. [4] 


Is 


-411.18370 


-405.24423 


-405.24445 


2s 


-53.46802 


-52.15020 


-52.15034 


2p 


-46.56265 


-46.23602 


-46.23616 


3s 


-7.40791 


-7.19093 


-7.19100 


3p 


-5.21133 


-5.16153 


-5.16160 


3d 


-1.59891 


-1.63483 


-1.63490 


As 


-0.56670 


-0.55336 


-0.55336 


4p 


-0.28614 


-0.28735 


-0.28735 



(b) Comparison of the total energy. 





SR 


NR 


NR Ref. [4] 


Total Energy 


-2097.4798 


-2075.3581 


-2075.3597 



of a system under the influence of a time- dependent external-potential, the evolution of the 
system is determined by the extremum of the quantum-mechanical action: 



A= / dt(i/)(t)\ 



to 



\i>(t)). (2.130) 



This is the stationary action principle and is one of the essential theorems of TDDFT by 
Runge and Gross [117]. The second theorem is that the the time- dependent density de- 
termines the external potential uniquely up to an additive time- dependent function. A 
consequence of this one-to-one mapping between the time- dependent potential and the time- 
dependent density is that the expectation value of any quantum mechanical operator is a 
unique functional of the density. In the work of Runge and Gross the external potential 
must be such that it can be expanded as a Taylor series in time [117]. An interesting feature 
of TDDFT is that the density functionals depend on the initial state of the system. 

The quantum mechanical action is a functional of the density, A[n], and has a stationary 
point at the correct time-dependent density. In analogy to the Kohn-Sham approach, it 
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is possible to introduce an auxiliary non-interacting system with the same time-dependent 
density as the interacting system n(r, t). The form of the action functional is given by 

\^i(t))-A H [n]-AJin], (2.131) 

where Ah and A xc are the time-dependent Hartree and exchange-correlation terms. Follow- 
ing a similar approach used in ground-state DFT, it is possible to derive the following set of 
time-dependent Kohn-Sham equations 

H{tmt) = i^, H(t) = - l -V 2 + V eff (v, t), (2.132) 

with the density given by 

n(r,t) = ^|^M)| 2 (2.133) 

i 

and the initial state 

* (to) = (2-134) 
The effective time- dependent potential felt by the electrons is 

V eff = - l -V 2 + V ext (r,t) + J dr' j^j + V xc [n](r,t), (2.135) 

where the time-dependent exchange-correlation functional is the variational derivative of the 
exchange-correlation part of the action functional 

Note that V^ c [n](r, t) is a function of r and t and a functional of the time-dependent density 
n(r',t') [29]. 



fi 



A[n] = J2 I dt(^(t)\ 



4 + 5V - vut) 
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2.7.1 The Adiabatic Local Density Approximation 

The time- dependent exchange-correlation functional V^ c [n](r, t) is of course unknown. The 
most commonly used approximation, largely due to its simplicity, is the Adiabatic Local 
Density Approximation (ALDA). In the ALDA the exchange-correlation functional is the 
static LDA exchange-correlation functional; there is no time (frequency) dependence. This 
can be justified for low frequencies or slowly varying densities [122, 29, 118, 123]. 

2.7.2 Linear Response 

TDDFT can be used to calculate neutral excitations of a system by finding the poles of the 
linear density response function. The interacting linear density response function 



Sn(r, t) 

x - 



6V ext (r>t> 



(2.137) 

v ext =o 



measures the degree to which the density responds to first order in the external potential. It 
is possible to derive a Dyson like equation for \ m terms of the non-interacting Kohn-Sham 
response 

Xo = JlM^ , (2.138) 



where 14// [n] = V ex t+ Vh [n] + V^ c [n]. The linear response of the fictitious Kohn-Sham system 
has poles at the Kohn-Sham eigenvalue differences. The relationship between \ an d Xo is 
given by: 

X{r,r',u) = xo(r,r',o;) + J dr 1 dr 2 Xo(r, r l5 u)K(r 1 , r 2 , u)x(r 2 , r', w) (2.139) 
where the interaction kernel is 

1 5V xc [n(r,t)] 



K(r ll r 2 ,uj) 



|ri — r 2 | <5n(r', t 1 ) 



;2.140) 



v ext =o 
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The first term in the interaction kernel is the Coulomb contribution and the second term 
is the time-dependent exchange-correlation contribution. In the ALDA the time-dependent 
exchange-correlation kernel SV xc [n(r, t)} Sn(r', t') is local in both time and space. 

Eq. (2.139) can be cast in the form of an eigenvalue equation involving pairs of occupied 
and unoccupied Kohn-Sham single-particle states and their respective eigenvalues. 

2.7.3 Explicit Real Time Calculations 

It is possible to carry out real-time calculations by explicitly evolving the single-particle 
orbitals {ipi} [1, 119, 123, 124, 29, 125]. This approach has the advantage over the linear 
response approach (see previous section) in that it is not limited to small perturbations and 
can be used for non-linear effects. In addition, the real-time approach has an advantage for 
larger systems since only the occupied states are propagated. 

From the real-time approach it is possible to calculate the polarizabilty a(u), which is 
one of the most important response functions because it describes the distortion of a charge 
cloud by an external field. The polarizabilty is related to the dipole strength function S(uj) : 



The dipole strength function S(u) is a useful quantity because it is proportional to the exper- 
imentally measured photoabsorbtion cross section a{oS). The relationship between between 
S and a is given by (in SI units): 



In addition, integration of S with respect to the energy gives the number of electrons: 






(2.142) 
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where the fa are the oscillator strengths. 

A way to calculate a(u) is outlined in Refs. [123, 1, 119, 29]. For time t < 0, the 
Hamiltonian H includes a perturbation due to an electric field, AH = E • x, along the 
a;— direction. Normal time-independent DFT is used to solve the Kohn-Sham equations for 
the ground-state of this system. At time t — 0, the electric field is switched off and the 
occupied Kohn-Sham eigenstates are evolved in time by solving Eq. (2.132) for the set of 
discrete time steps ti = iAt. For each time step, the electron density is generated by the new 
set of orbitals, which are then used to calculate the effective Hamiltonian for the next time 
step. The effective Hamiltonian is a function of time since the density n(r, t) is a function 
of time, but for t > there is no external time dependence. 

For every time step the dipole moment D(t) is calculated: 



where e l5t is a damping factor for convergence at large times. To linear order the polariz- 
ability is given by 




(2.144) 



The Fourier transform of D(t) is given by 




(2.145) 



D(u) = a(cu)E(uj) 



(2.146) 



The Fourier transform of E(t) = EQ(—t), where E is a constant, is given by 



E 



(2.147) 



which yields the relation 



(2.148) 
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2.8 Broken Symmetry Solutions 

Typically the mean-field equations occurring in Hartree-Fock or Density Functional Theory 
are solved in a restricted manner. This means that the spin-orbitals are constrained to 
have the same spatial function for the spin-up and spin-down spin functions. Due to the 
exchange interaction between like-spin electrons, each spin-species experiences a different 
effective potential and would not prefer to be described by the same spatial function. For 
high densities, i.e. small values of the Wigner-Seitz parameter r s , the kinetic energy domi- 
nates the electronic structure and restricted solutions typically agree well with those found 
by an unrestricted approach, where the spin-orbitals have different spatial orbitals for the 
different spin functions. For lower densities, where the electron-electron interactions domi- 
nate, unrestricted solutions produce much better ground state energies reflecting a gain in 
correlation. Unfortunately the unrestricted approach has the major drawback of producing 
broken symmetry solutions which are no longer eigenstates of the original Hamiltonian. 

An excellent example of this phenomenon is the disassociation problem for the ground- 
state of the Hydrogen molecule [126]. The Hydrogen molecule disassociates into two non- 
interacting Hydrogen atoms with arbitrary spin orientation. The energy difference between 
the singlet (5=0) and triplet (S—l) states, AE = E s — E t , must approach zero from below 
as the molecule disassociates - the singlet always being the ground state. In the restricted 
Molecular Orbital (MO) approach, the spin-up electron occupies the same molecular orbital 
as the spin-down electron. These orbitals are used to construct a single determinant wave 
function which is an eigenstate of the total spin S 2 and ^-component S z , where S = Si + S2. 
Due to this constraint the restricted MO approach fails to yield the correct (H + H) product 
at large nuclear separation. Using an unrestricted MO approach results in much better 
energies but produces broken symmetry solutions that are no longer eigenstates of the spin. 
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The unrestricted HF wave function has the generalized form 



*(ri,r 2 ) 



1 

V2 



u(n)a(l) v(ri)/3(l) 
u(r 2 )a(2) v(r 2 )(3(2) 



(2.149) 



where u(r) and v(r) are the unrestricted spin- up and spin-down orbitals, respectively. By 
taking a linear combination of Slater determinants of unrestricted MOs, it is possible to re- 
store the symmetry and produce a solution that is a spin eigenstate. The resultant expression 
for the singlet 



* s (ri,r 2 ) oc 



can be reduced to, 



u(n)a(l) u(ri)/3(l) 
u(r 2 )a(2) v{v 2 )(3{2) 



u(n)/3(l) «(n)a(l) 
u(r 2 )/?(2) u(r 2 )a(2) 



(2.150) 



* s (ri,r 2 ) oc (u(ri)u(r 2 ) + u(r 2 )v(ri))x(0, 0), 



(2.151) 



where x(0, 0) = (a(l)/?(2) - a(2)/3(l))/ v / 2 is the spin eigenfunction for the singlet [127, 128, 
129]. This solution has the form of a Heitler-London [130] or valance bond wave function, 
which can be shown to be of lower energy than the original unrestricted solution. 

The important point is that the orbitals u and v are solutions to the mean-field equation 
and not the original many-body Hamiltonian. For this reason mean-field methods give sharp 
transitions, whereas a correct many-body treatment gives a continuous variation. It is well 
known in nuclear physics [131] that a simple wave function, i.e. a product of single-particle 
orbitals, cannot properly describe correlations between the electrons if the proper symmetry 
behavior is simultaneously required. 
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Chapter 3 



Theoretical Methods: Many-Body 
Methods 



3.1 Introduction 

It is well known that single-body methods such as Density Functional Theory (DFT) in the 
widely used Local Density Approximation (LDA) underestimates the gaps for excitations, 
whereas Hartree-Fock (HF) overestimates the gaps [9, 37, 132]. In principle time-dependent 
DFT (TDDFT) can describe the excitation spectra exactly [117]; however, the exact func- 
tional is not known and the accuracy of practical approximations is not established [118]. 
Thus it is desirable to go beyond single-body theories to a many-body theory. 

In this chapter the many-body methods of Configuration Interaction (CI), GW and 
GW combined with a solution to the Bethe-Saltpeter Equation (GW-BSE) are reviewed. 
The Quantum Monte Carlo method is discussed separately in the succeeding chapter, see 
chapter 4. 

3.2 Configuration Interaction 

Exact-diagonalization procedures such as Configuration Interaction (CI) [133] use a linear 
combination of iV-electron Slater determinants to span the Hilbert space of the iV-electron 
system. The form of this expansion is given by 




(3.1) 
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where the Di are the Slater determinants. In principle CI provides an exact answer for not 
only the ground state, but also all of the excited states in the limit of a complete basis. 

CI is an improvement over the Hartree-Fock method, which incorporates exchange by 
using an antisymmetric Slater determinant, but neglects the electron correlations result- 
ing from the Coulomb interaction. In practice the number of iV-electron determinants is 
enormous, usually only single and double excitations are included, and one must restrict 
the basis. Another drawback of this method is that a large number of determinants are 
needed to describe the gradient discontinuities at the cusps where two electrons have the 
same position. 

A common way to create the Di are to use the results of an SCF calculation such as 
Hartree-Fock. An "excited" Slater determinant is created by promoting an electron from an 
occupied orbital to a virtual (excited) orbital. Below we provide a brief summary of some 
of the more common CI methods: 

• Full CI: The determinants in Eq. (3.1) contain all possible excitations. The number of 
excitations is determined by the size of the basis. 

• Singles-Doubles CI (SDCI): Only single and double excitations are allowed. Singles 
only CI is also known as CIS. 

• Mult i- Configuration SCF (MRSCF): Both the CI and SCF coefficients are optimized. 

• Complete Active Space SCF (CASSCF): This is a special case of MRSCF, where a set 
of electrons are designated as being active; only excitations involving the active space 
of electrons are permitted. 

Closely related to CI is the coupled-cluster perturbation technique. A common imple- 
mentation is the coupled-cluster singles- doubles (CCSD) method, which only considers single 
and double excitations. 
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3.3 The GW Approximation 



GW [37, 134, 9, 118, 17, 135, 39, 38] is a many-body method to describe quasiparticle 
excitations. A quasiparticle is the interaction of a bare particle, in this case an electron, and 
a positive screening charge surrounding the particle. For electrons the positive screening 
charge is due to the Coulomb potential; the repulsion between the electrons leads to a 
depletion of negative charge around a given electron. The mathematical description of 
quasiparticles is by the single-electron Green's function. More specifically, the quasiparticle 
energies are the poles of the one-electron Green's function. These energies correspond to 
addition and removal energies in atoms and molecules, and can be used to compute the band 
gaps in solids. 

GW is a more direct method of calculated energy differences than performing separate 
total energy calculations. This is because total energy differences are always less accurate 
than the constituent total energies. This is especially true in QMC method. 

GW is directly relevant to this work since the best known calculations of band gaps in 
Ge and related semiconductors were performed by Shirley, Zhu and Louie [9] using the GW 
approximation. These calculations were performed with a psuedopotential using the CPP 
approach in Section 7.7. Also, the CPPs are fit to GW self- energies for the lowest bound 
valence electron of each angular momentum channel of interest outside a bare core [43]. 

The exact single-electron Green's function is given by 



where \N, 0) is the interacting ground state for the N electron system, T[. . .] is the time- 
ordering operator, $(x) = e' lHt &(r)e~ lHt is the fermion annihilation operator in the Heisen- 
berg representation and $^(x) is the corresponding fermion creation operator [136]. Note 
that in Eq. (3.2) the coordinate x represents the spin and space variables: x = (r,t). For 
t > t' the Green's function describes the propagation of a particle added to the many-body 




(3.2) 
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system, while for t < t' the Green's function describes the propagation of a hole added to 
the many-body system. By the insertion of unity in the given Fock space of the N + 1 and 
N - 1 systems (£\ \N + V,i)(N + and J2j \ N ~ ~ the Green's function 

can be written as 

G = -2^(iV;0|$(x)|iV + l;z)(iV + l;z|<l> t (x / )|iV;0)e(t-t / ) 

i 

+i ^(iV; 0|$ t (x)|iV - l;j)(N - 1; j|$(x')|iV; 0)6 (f - t), (3.3) 

3 

where \N ± l;k( denotes the kth state of the N ± 1 particle system. Introducing the 
quasiparticle wave functions and their corresponding energies 



(3.4) 



<Mr)(iV; 0|$(r)|7V + = E w+M - Ejv.o > p 

^■(r)(iV-l;j|$t(r)|iV;0), E i = E N , - E N _ lti < n 

where /x is the chemical potential, the Green's function becomes 



G = -i &(r)&(0 esxp[iEi(t - f)]Q(t - t') 

i 

i 0j( r )0j( r ') exp[-iEjtf ~ *)]©(*' - <) (3-5) 

j 

The Green's function is given in the frequency (energy) domain a Fourier transform is with 
respect to (t — t') : 

= (3-") 



where the sum n is over all (iV — 1) and (N + 1) particle states and the ±irj terms are 
for convergence. In this representation the quasiparticle energies correspond to poles of the 
Green's function. 

It is important to note that the quasiparticle energies have an actual physical interpre- 
tation, unlike the single-particle eigenvalues from Kohn-Sham DFT which are mathematical 
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constructs, i.e. Lagrange multipliers. The quasiparticles also have finite lifetimes since they 
are not the exact eigenvalues of the many-body Hamiltonian H. The quasiparticle ener- 
gies are often regarded as corrections to the Hartree-Fock eigenvalues which approximately 
describe electron addition and removal energies by Koopmans' theorem, see Section 2.3.2. 

The Green's function is a solution to Dyson's equation. A particular form of Dyson's 
equation (without the spin degrees of freedom) 

-^v 2 + v H + v ext 

where V ex t is the external potential, in the case of atoms V ex t(r) = —Z/r, Vh is the Hartree 
potential 

V H (r) = [ dr' J^lr. (3.8) 
J |r — r | 

and E is the self-energy. The self-energy E is a non- Hermit ian, energy dependent, non-local 
operator that describes exchange and correlation beyond the Hartree (Vh) interaction level. 
In the GW approximation (GWA), E is approximated in terms of the Green's function G 
and the screened interaction W; in the frequency domain E is given by 

/i 
— e^Gtr, r'; E + u)W(r, r'; -u). (3.9) 
27T 

The approximation is the first-order (in W) truncation of an expansion of Hedin's exact 
coupled many-body equations [135]. 

3.4 GW-BSE 

The GW-BSE method refers to the GW approximation, see Section 3.3, combined with a 
solution to the Bethe-Saltpeter Equation (BSE). This method describes the how a system 
reacts to the introduction of two interacting quasiparticles (quasielectron and quasihole) 



dv'^vy-EMr') 



EMv) 



(3.7) 
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into a system such that bound excitons exist. Therefore accurate promotion energies are 
accessible. 

GW-BSE involves solving an equation that appears to be similar to the Dyson equation in 
Section 2.7.2 for TDLDA within linear response. The main difference between TDLDA and 
GW-BSE is the choice of the interaction kernel K and the single-particle states that enter 
the equations. In TDLDA the single-particle states are the DFT-LDA Kohn-Sham single- 
particle states, while in GW-BSE the single-particle states are quasiparticle states. The 
quasiparticle states are obtained from GW and include self-energy effects. Therefore GW- 
BSE determines excitations of the A— electron system by making reference to the (A + 1) 
and (A — 1) system, while TDLDA deals with the A— electron system. The BSE interaction 
kernel couples the quasielectron and quasihole states to describe the exciton [56, 25, 118, 57]. 

The GW-BSE perturbation approach consists of three successive steps. The first step 
is performing a DFT-LDA calculation to obtain the single-particle Kohn-Sham states. As 
a second step the LDA results are used to obtain the self-energy £ within the GW ap- 
proximation, which yields the quasiparticle states. Finally, the quasielectron and quasihole 
states and their respective energies are inserted in the BSE to obtain the optical absorbtion 
spectrum. 

3.5 Quantum Monte Carlo 

For a discussion of Quantum Monte Carlo (QMC) see Chapter 4. For a discussion of many 
of the technical details of QMC, including the form of the many-body trial wave function, 
see Chapter 6. 
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Chapter 4 

Quantum Monte Carlo Methods 



4.1 Introduction to Monte Carlo 

Monte Carlo refers to a broad class of methods use that use random numbers to solve physical 
problems [137, 7, 106]. One of the most important applications of Monte Carlo is to solve 
multi-dimensional integrals of the form 



dR/(R), (4.1) 



where / is assumed to be a smooth function over the interval and R is a <i-dimensional 
coordinate. In quantum Monte Carlo we are interested in computing integrals of the form 

where II is a probability distribution and A is a quantum mechanical operator local in 
position space. The quantity (A) is a quantum mechanical expectation value of the operator 
A with respect to the probability distribution II. 

Typical grid based integration (quadrature) methods evaluate the integral in Eq. (4.1) 
over a set of N points {R,; i — 1, . . . , N} with the associated weights io*: 

1 N 

i=l 

We will assume that the grid is uniform and that the grid-spacing is h. The weights are 
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independent of the integrand / and are determined by a polynomial expansion; the accuracy 
of the expansion determines accuracy of the integration method. In general, the error of the 
integration method can be written in terms of the grid-spacing h as 0[/i fc ], where k > 1. The 
relationship between the grid-spacing and the number of grid-points N for a <i-dimensional 
hypercube with side L is N = (L/h) d . Therefore, the error of standard quadrature methods 
scales with respect to the number of integration points as 0[N~ k ' d \. 

Monte Carlo methods produce an error that scales (converges) as 0[iV _1 / 2 ], independent 
of the dimension of the integral, while using a set of random configurations {R-;} with uniform 
weights Wi — 1. In the language of Monte Carlo a configuration or a walker represents the 
set of variables that describes the state of the system: for a multi-dimensional integral the 
state corresponds to R, the d— dimensional coordinate. The process by which the walkers 
are generated is known as a random walk which will be discussed in more detail later. 

It quickly becomes clear that Monte Carlo has an advantage over quadrature methods 
for higher dimensional integrals. In order to reduce the error of a Monte Carlo integration 
by a factor of two, it is necessary to quadruple the number of walkers (points); this scaling 
oc A r ~ 1 / 2 is far superior than the scaling standard quadrature methods for high dimensional 
integrals oc N~ k / d . In addition, the number of grid points required to evaluate a multi- 
dimensional integral can be prohibitively large. For example, to evaluate a 10-dimensional 
integral with only 10 points per side results in a grid with N = 10 10 points. 

It is imperative to have good random number generators for Monte Carlo simulations. 
There is no such thing as a truly random number generator, instead we use pseudo random 
number generators. Pseudo random number generators are deterministic and have a finite, 
although very large, period. All of the work in this thesis uses pseudo random number 
generators provided by the BOOST [138] and SPRNG [139] libraries. 
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4.2 The Central Limit Theorem 



The Central Limit Theorem (CLT) [140] is the reason why the error of Monte Carlo inte- 
gration scales as OlN^ 1 / 2 ], independent of the dimension of the integral, where N is the 
number of independent configurations used to evaluate (sample) the integral. 

In order to understand the CLT, start with a set of uncorrelated configurations {R,, : % = 
1, . . . ,N} distributed according to a probability distribution function V(R). Define a new 
random variable 



1=1 



and variance 



where Z is a function of mean 

Hz = J dRV(R)Z(R) (4.5) 

a\ = J dR V(R) [Z(R) -fi z } 2 . (4.6) 

The variance measures the fluctuations about the mean. In the limit of an infinite number 
of configurations 

Hz = lim Z N , (4.7) 

N— >oo 

where fiz is the exact mean defined in Eq. (4.5): in practice we would like to estimate the 
error of \ji z — Z^\. 

The CLT states that for large enough N, Z^ is normally distributed about the mean \xz 
with the variance cr|/7V, where <y\ is the variance of the function Z. The variance of the 
mean is the variance of Z^, and not of the underlying probability distribution function from 
which the {R} are drawn. The variance of the mean provides a measure of certainty with 
which Zn is equal to ji z [31, 7]. 

In practice the mean \xz is measured, as is a|; therefore the denominator is reduced by 
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one, leading to the standard formula for the variance of the mean 




(4.8) 



4.3 Importance Sampling 

The most simple implementation of Monte Carlo integration is to homogeneously evaluate 
the integral in Eq. (4.1) by uniformly sampling over the integration volume. This method 
can lead to large statistical fluctuations if a significant contribution to the integral is over 
a small region of the integration volume: only a relatively small number of configurations 
sample the function in this region. Ideally, we would like the majority of configurations to 
sample the region where |/(R)| is large. 

At this time it is convenient to introduce the importance-function g(R) (defined on the 
same domain as /(R)), and rewrite the integral in the form 



The best choice of g(R) is such that the quantity h(R) = /(R)/gf(R) is roughly constant; 
consequentially the fluctuations are greatly diminished. If the importance function has the 
following properties: 

1. g(R) > 

2. Normalization J dHg(R) = 1 

g(R) can be interpreted as a probability distribution function. In this case the integral in 
Eq. (4.9) is the expectation value of /i(R) with respect to the probability distribution g(R) 
with 




(4.9) 




(4.10) 
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Our goal is to come up with a method to correctly sample g(R) in the relevant domain 
to generate a set of configurations {R}, such a method is described in the next section. 
Consequentially, Monte Carlo evaluation of the integral is equivalent to taking the mean of 
the function with respect to the set {R} : 

^^Ew> ( 411 ) 

i=i 

where all the weights are unity since the configurations have a stationary distribution g(R). 
Eq. (4.11) is known as an estimator and the CLT (Section 4.2) determines the error of h^. 

4.4 Markov Chains and the Metropolis Method 

A way to generate a set of configurations {R} with a stationary distribution defined by the 
probability density function V(R) is by a Markov chain [7, 106]. Each new configuration in a 
Markov chain is generated by a probability distribution which depends on the current state 
of the system, independent of time and history of the state, i.e. it does not depend on how 
it got there. The entire set of configurations forms a chain. Markov chains are defined in 
terms of a state-to-state transition probability T(R <— R'). We are interested in T(R <— R') 
with the normalization property 

£r(R<-R/)=l, (4.12) 

R 

where the sum runs over all states of the system. For a process to be Markovian it must 
meet the requirements of ergodicity: all states of the system must be accessible from every 
other state within a finite number of steps and there can be no periodicity [106]. 

The Metropolis method [141] is a recipe to generate a Markov chain for the invariant dis- 
tribution P(R) by a random walk in configuration space. As a motivation for the Metropolis 
method, define the quantity P(R, t), which is the probability of the system to be at the con- 
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figuration R at time t. The rate of change of this probability distribution from t to t + At 
is determined by two processes: 

1. Configurations moving from R to other R', decreasing V(R) 

2. Other configurations moving from R' to R, increasing V(R). 
The mathematical relation for the rate of change of V(R, t) is given by 

P(R,t + At)-P(R,t) = J2T(R <- R')V(R',t) - ^T(R' <- R)V(R,t). (4.13) 

R' R' 

For the equilibrium or steady-state solution we demand that the rate of change of the 
probability is zero "P(R, t + At) = V(R, t), or 

T (R <~ B/)V(R') = T ( R ' <- R)^(R), ( 4 - 14 ) 

R' R' 

where V is now independent of t. A particular solution to this equation is the detailed balance 
condition 

T(R <- R')P(R') = T(R' <- R)P(R), (4.15) 

which relates the pair of configurations R and R'. In other words, the rate of walkers moving 
from R to R' is equal to the rate of walkers moving from R' to R [106]. 

The next step is to split the transition probability into the product of a trial-step proba- 
bility W and an acceptance probability A : 

T(R <— R') = W(R <— R')^4(R <— R'). (4.16) 

The trial-step probability has the normalization property W(R <— R') = 1 and the 
acceptance A is a number between and 1. For the case where the trial-step probability is 
symmetric W(R <— R') = W(R' <— R), the detailed balance condition can now be written 
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as 

A(R <- R') V(R) 



A(R' <- R) P(R') 

with the solution 



(4.17) 



g, if P(R) < P(R') 
^(R^R) = { (4.18) 

1, if V(R) > P(R') 

These two conditions can be concisely written as A(R <— R') = min[l, q\. 

The generalized Metropolis Algorithm is the case where W(R <— R') 7^ W(R' <— R); in 
this case the detailed balance condition can be written as 

A(R - R) _ W(R <- R)V(R) _ ^ 



A(R' <- R) W(R <- R)^(R) 



with the solution 



.A(R <- R) = min[l, g]. (4.20) 

For an example of the generalized Metropolis Algorithm see Algorithm 1 . 

Algorithm 1 The Generalized Metropolis Algorithm. In order to accept a move with 
probability q, where q G [0,1], choose a random number ( from the uniform distribution 
[0, 1]. If £ < g, accept the move, otherwise the move is rejected. Note that in the Metropolis 
method, if a proposed move is rejected, the configuration returns to its previous state. This 
algorithm can be performed with a single configuration (walker) or an ensemble of walkers, 
for all Configurations R; advance to Rj+i do 

Move from R; to R'i with a probability VV(Xj <— X ; ) 

Evaluate the acceptance ratio 

A(x' ^_ X-\ - min h W ( x i- X P^( R, ') ~ 

"*H^i ^~ A V — 111111 W(X^Xi)P(R») 

Compare .A(X. <— Xi) to a random variable £ G [0, 1). 
if C < *<4 then 

Accept the move, R'j is the new configuration: 

R-i+i = R'j 

else 

Reject the move, stay at R; : 

R«+i = Rj 

end if 
end for 
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4.5 Statistics in Monte Carlo 



The material in this section closely follows Ref. [142]. The definitions of the mean and 
variance are, 



(A) = lim 

M^oo 



1 M 

i=l 



and 



v 2 A = ({A-{A)f)., 



(4.21] 



(4.22) 



respectively. It is convenient to evaluate the variance as 



< = (A 2 ) - (A)'' 



(4.23) 



which can be shown to be equivalent to Eq. (4.22). 

In practice Monte Carlo simulations produce a finite number of data points {Ai : i = 
1, . . . , M} (usually printed to an output file), which requires the mean and variance to be 
evaluated by the approximate relations 



M 
i=l 



(4.24) 



and 



1 M 

i=i 



(4.25) 



The error of the mean follows from Eq. (4.8) 



a 




(4.26) 



It is always necessary to equilibrate the data before doing any statistical analysis. Equi- 
libration entails removing the first block of data points - this is necessary because it takes 
several Monte Carlo steps before equilibrium (detailed balance) is reached. 
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Eq. (4.26) assumes that successive values of the random quantity Ai are statistically 
independent. In reality successive values of are printed to file with sufficient frequency 
such that they are highly correlated with each other. At this point it is necessary to define 
the correlation function 



which measures the correlation between points k steps apart. The function c A {k) has the 
property c A (0) = 1 and decays exponentially with respect to k; there exists a k such that 
CA(k) ~ for all k > k$. The auto correlation length is defined as 

fco 

k a = l + 2^c A {k). (4.28) 
k=i 

The correct expression for the error of the mean is 



this is known as the error bar. It is always necessary to include the error bar for any Monte 
Carlo result. The error bar can written in two ways: for a Monte Carlo simulation with a 
mean of 0.1267 and an error bar of 0.023 the result can be written as 0.1267 ± 0.0023 or 
more commonly 0.1267(23). 

4.5.1 Data blocking 

Data blocking [142, 7] is a method where successive values of Ai are grouped into blocks 
defined by a block length: 



c A (k) 



EtLi k (A t -(A))(A +k -(A)) 
EUA-(A)r 



(4.27) 




(4.29) 




(4.30) 
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The value (A)j is a block average over all the data between (j — l)m+ 1 and jm, where m is 
the block size. In the case where the block size m is greater than the autocorrelation length, 
m > ko, the quantities (A)j and (A)j+i are statistically uncorrelated. In practice it is not 
necessary to choose m > fco since this may result in large blocks sizes and consequentially 
reduce the amount of data available from the Monte Carlo simulation for statistical analysis. 

Data blocking can either be performed directly by the Monte Carlo program itself, where 
only the block averages are printed to the output file, or by a data analysis program. Much 
of the statistical analysis performed for the work in this thesis is from DataSpork [143]. 
This is a java based program with a graphical interface that allows the user to define the 
equilibration length and perform data blocking. 

Figure 4.1 shows the results for the local energy of the Li2 molecule from a variational 
Monte Carlo (VMC) simulation. The VMC method is discussed in detail in Section 4.7. 
Figure 4.1(a) plots the local energy with respect to the number of Monte Carlo steps while 
Figure 4.1(b) plots the autocorrelation function of the data. 

4.6 Introduction to Quantum Monte Carlo 

Quantum Monte Carlo (QMC) [144, 145, 146, 147, 148, 149, 150, 151, 152, 30, 153, 8, 7, 31, 
154] is a many-body method which has been applied to a wide variety of strongly interacting 
electronic systems. The two QMC techniques discussed in this thesis are variational Monte 
Carlo (VMC) and the projection based diffusion Monte Carlo (DMC) in the fixed-node 
approximation. Both of these methods are variational and are only applicable to quantum 
systems at zero-temperature. VMC and DMC have been able to attain chemical accuracy 
(about 0.04 eV per molecule) in small systems. In addition, the computational effort of 
QMC techniques scales with respect to the number of electrons, N, as approximately N 3 , 
which is advantageous over other many-body methods. 
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Steps 

(a) The trace of the local energy. 




(b) The autocorrelation of the local energy. 



Figure 4.1: 4.1(a) The results of a VMC simulation for the local energy of an Li2 dimer. 
The local energy is computed as an integral over 3N— dimensions, where N = 6 is the 
number of electrons. The vertical bar denotes the equilibration length; all averages are 
computed after this cutoff. 4.1(b) The autocorrelation function of the local energy. The 
autocorrelation length is approximately equal to 4.0; choosing a block size of this length 
reduces the correlation between successive values of the local energy to zero. 
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4.7 Variational Monte Carlo 



The variational Monte Carlo (VMC) method is based on the combination of the variational 
theorem from quantum mechanics and the Monte Carlo evaluation of integrals. The integral 
under consideration is the expectation value of the energy for a trial wave function 

(*t\H\* t ) /rfRgfr(R)ffgr r (R) , . 

[ Tl (tt T |tt T ) fdR**(R)* T (R) ' 1 ' 

where R is a 3iV-dimensional configuration vector representing the coordinates of all the N- 
electrons and \Pr(R) is a many-body trial wave function that explicitly includes correlation. 
This thesis is strictly limited to real wave functions; therefore \Pt = ^t- This integral can 
be written in a form more convenient for Monte Carlo 

= JffiWW (432) 

where the local energy -Ex(R) is defined as 

E L (R) = *^ 1 (R)/7* T (R). (4.33) 



The Metropolis algorithm is used to generate a set of configurations with the probability 
distribution V(R) = *r(R)/ / dR*|(R) : the energy E^t] is estimated by averaging the 
local energy over this set of configurations: 

1 M 

(4.34) 

i=i 

where equality holds in the limit M — > oo. The variational energy provides an upper bound 
to the exact ground-state energy 

£o < £[*t], (4.35) 
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where Eq is the exact ground-state energy. 

The specific form of the trial wave function is of the Slater- Jastrow form, and is discussed 
in detail in Chapter 6. The local energy is constant if the trial wave function ty? is an exact 
eigenfunction of the Hamiltonian. If ty? = where is an eigenfunction of H with the 
eigenvalue such that H^i = E^i, it can easily be shown that El = Ei. Consequentially, 
the variance of the integrand in Eq. (4.32) goes to zero. This is known as the zero variance 
principle for the energy estimator, and applies to the ground-state as well as exited states. 

The expectation value of any quantum mechanical operator, A, diagonal in position 
space, can be calculated using a similar expression to Eq. (4.32) 

<•*> = T^m • (436) 

where .A(R) = \1/t(R)~ 1 ^ , I / t(R)- The variance of the estimator is given by 

2 frfR^(R)l4(R)-(.4)] 2 

^ = — jdR^m — • (4 - 37) 

In simple VMC the trial moves are sampled from a Gaussian centered on the current 
position of the walker. The size of the move determines the acceptance ratio. The more 
widely used importance- sampled VMC is discussed in Section 4.10.1. For our importance- 
sampled VMC algorithm see Section 5.2. 

Table 4.1 features test results for VMC, where VMC total energies are compared to 
Hartree-Fock (HF) total energies. The VMC trial wave function is identical to the resulting 
HF wave function from the SCF calculation. Since the Hartree-Fock approximation contains 
no correlation, the VMC total energies should agree with the Hartree-Fock total energies to 
within an error bar, which is confirmed by Table 4.1. We use both an all-electron move and 
a particle-by-particle move, see Section 5.2 for a more detailed discussion. 
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Table 4.1: Test results for VMC featuring all-electron move unless otherwise specified. The 
trial wave function for VMC is identical to the HF wave function and includes no Jastrow 
correlation factor. All HF wave functions and energies from Ref. [5] unless otherwise speci- 
fied. All units in a.u. 
' Particle-by-particle move. 
* HF results from Ref. [6]. 





HF 


VMC 


time-step 


Accept (%) 


Be^S) 
N( 4 S) 
N( 4 S)t 
0( 3 P) 


-14.57302 


-14.57288(31) 


0.05 


59 


-54.40092 
-54.40092 
-74.80937 


-54.40083(34) 
-54.40099(24) 
-74.80892(60) 


0.015 

0.02 

0.0125 


58 
85 
53 


H 2 0* 


-75.9693 


-75.9691(14) 


0.01 
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4.8 Umbrella or Correlated Sampling 

From Section 4.7, the expectation value of the energy of a trial function ty? can be computed 
by averaging the local energy El = ^ H^t over a set of configurations drawn from the 
distribution However, under certain circumstances it may be more convenient to sample 
the distribution where \&g is a guiding function, rather than By re-weighting it is 
possible to obtain the average with respect to the original distribution : 

where E L (R) = ^ 7 ; 1 (R)if^ T (R) and the weights 

»(H) = HI (4-39, 

convert the distribution ^> 2 G to Some care must be taken in the choice of to insure 
that the weights are stable. Ideally should either be non-zero over the entire domain, or 
have a nodal surface close to the nodal surface of ^t- If is computationally less intensive 
to evaluate than *&t, Eq. (4.38) can represent an improvement in the efficiency of VMC. 
The efficiency is the error bar achieved in a given amount of computation time. Mathe- 
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matically this is given by the expression 



C = -5^—, (4.40) 

where is the error of the mean and Tcpu is the computation time. Typically it is more 
interesting to compare the efficiency between two calculations 

C[2] *2[1}Tcpu[IY 1 ' ] 

So it is possible to improve the relative efficiency by either reducing cr M or T C pu or both. 
The error of the mean is determined by the accuracy of the trial wave function, i.e. the 
inherent variance, and the number of Monte Carlo steps (see Eq. (4.29)). The computation 
time is determined by the complexity of the trial wave function. 

Define the wave functions and ^2 and their corresponding local energies, E^/ = 
ty^^^H^i and E^ = ^ 1 H^ 2 i respectively. It is possible to calculate the energy difference 
(AE) = (E^)^ - {Ef ] )^ G directly by application of Eq. (4.38): 

(AE) = Hm E^ = i^ (1) (R m )4 1) (R m ) _ E^ = i^ (2) (R m )4 2) (R m ) u 42) 
M -°° E™=i^ (1) (iW Ei=i^ (2) (R m .) 

where ^>o is the guiding function. Since the energies E L (R m ) and E\ (R m ) are computed 
over the same set of configurations (R4, . . . , Rm)> the energy difference in Eq. (4.42) will 
have a lower variance than if the two energies were computed separately. This method of 
computing energy differences is known as correlated sampling [155, 7]. 
For the guiding function given by the expression [156, 157] 

N 

v^(R) = ]T^(R) (4.43) 
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the weights are 




-i 



■w 




(4.44) 



The set {^i} can represent perturbations from an initial wave function \l/o [7]. Examples 
of such perturbations can be external electric fields or changes in the geometry of the ions. 
Another possibility is that the {^i} can represent the ground and excited states of a system. 
The only restriction on Eq. (4.43) is that all the {^i} are of the same spin-configuration: 
the dimensions of the determinants for the spin-up and spin-down electrons are all identical. 
The weights, Eq. (4.44), can be used to evaluate the ratio 



where V is the 3N— dimensional gradient operator. Using ^/q from Eq. (4.43) in the corre- 
lated sampling framework can represent a significant improvement in efficiency if the wave 
functions are highly correlated. Regardless of the statistical correlation, it is faster to simul- 
taneously calculate several excited states rather than performing independent calculations 



4.9 Introduction to Projection Based Methods 

While VMC is an excellent method for investigating properties of trial wave functions and 
is a systematic improvement over single-body methods such as HF and LDA, the ultimate 




(4.45) 



The gradient is determined by the relation 



(4.46) 



[51]. 
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Table 4.2: The ground-state energy (E GS ), excited state energy (E EX ) and the energy 
difference (AE = \E GS — E |) for GeH 4 and Ge2H 6 from independent VMC simulations 
compared to the correlated sampling results (VMC-CS). Results with the CPP are also 
included. All energies in a.u. 







H/ 


Hi 


A P 


No rpp 






O. 1 OUOjI O 1 ! 1 


U.OOUiO IIO 1 




VMC-CS 


-6.17250(22) 


-5.78667(34) 


0.38583(36) 


GeH 4 










CPP 


VMC 


-6.23251(27) 


-5.85524(33) 


0.37727(43) 




VMC-CS 


-6.23231(35) 


-5.85508(35) 


0.37722(35) 


No CPP 


VMC 


-11.18961(42) 


-10.88881(42) 


0.30080(59) 




VMC-CS 


-11.18915(48) 


-10.88852(62) 


0.30063(58) 


Ge 2 H 6 










CPP 


VMC 


-11.30732(45) 


-11.01420(49) 


0.29312(67) 




VMC-CS 


-11.30782(38) 


-11.01502(46) 


0.29280(44) 



accuracy of the algorithm is limited by the quality of the trial wave function. 

Projection based techniques are capable of obtaining the true ground-state properties for 
the iV— electron system. This is possible by using the imaginary-time propagator e~ T ^ H ~ ET \ 
where r is an interval in imaginary time, H is the TV— electron Hamiltonian and E? is an 
adjustable trial energy. Applying the imaginary-time propagator to a trial wave function 
which has been expanded in terms of the exact eigenfunctions of the Hamiltonian H, 
leads to 

e -r(H-E T )^ T = ^$ ie -^-^r)($.|\Er T ) 7 (4.47) 

i=0 

where Ei is the eigenvalue of the eigenfunction $j. In the large r limit 



lim e - T{H - ET) ^ T = <S>oe- T{Eo - ET) (<S>o\y T ), (4.48) 



the 2 = term dominates if E? = E ; the terms in the expansion involving the excited states 
are exponentially damped. 1 

The only projection based technique discussed in this thesis is diffusion Monte Carlo 



It is assumed the there is a non-zero overlap between the trial wave function and the ground-state $o- 



75 



(DMC). Examples of other projection based methods are various Green's Function Monte 
Carlo (GFMC) techniques [146, 158, 151, 159, 7] and Reptation Monte Carlo (RMC) [160, 
161]. 



4.10 Diffusion Monte Carlo 

The diffusion Monte Carlo (DMC) method can be derived from the imaginary-time Schrodinger 
equation 

d$(R, t) = ^ _ £ r)$(Rj ^ (4 49) 



dt 

where E? is an energy shift and R = (r 1; r 2 , . . . , r^) is a 3iV-dimensional configuration 
vector. The solution to Eq. (4.49) is given by 

|$(t 2 )) = e-^-^-^mh)), (4.50) 

which can be confirmed by insertion of Eq. (4.50) into Eq. (4.49) and differentiating with 
respect to ti- Let r = t<i — t\ be an interval in imaginary time and t = t\\ Eq. (4.50) can be 
rewritten as 

mt + r)) = e~ T{H ~ ET) mt)}. (4.51) 

Eq. (4.51) can be written in integral form by the insertion of the unity operator in the 
position space representation (the complete set of position states 1 = J dR'|R')(R'|) between 
exp[— t(H — Et)] and and multiplying on the left by the position state (R| : 



$(R,t + r) = J dR'G(R<— R';r)$(R',t). 



(4.52) 



The term G(R R'; r) is defined as the Green's function for the Hamiltonian H and has 
the form 

G(R <- R; r) = (R\e- T{H - ET) \R'}. (4.53) 
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The Green's function obeys the same imaginary-time Schrodinger equation, Eq. (4.49), as 
the state $(R,i) : 

—d T G(R <- R; r) = (H(R) - E T )G(R <- R; r), (4.54) 
with the initial condition 

G(R<-R';0) = 5(R-R'). (4.55) 

The Green's function in Eq. (4.53) cannot be evaluated directly, otherwise there would 
be no need for DMC, but can be approximated for short time intervals. This is possible by 
application of the Trotter-Suzuki formula for the exponential of a sum of non-commuting 
operators A and B 

e -Ar(A+B) = e -ArB/2 e -ArA e -ArB/2 + ( 4 . 56 ) 

where Ar is a short time interval. The Hamiltonian (H = T + V) is the sum of the N- 
electron kinetic energy operator T and a potential V, which consists of electron-electron, 
ion-electron and ion-ion terms. The short time approximation to the Green's function 

G(R <- R; T) W e -Ar[V(R)-^]/2 (R | e -Arf | R / )e -Ar[V(R0-^]/2 ; ^ 5J) 

from Appendix A. 6 it can be shown that Eq. (4.57) yields 

G(R <- R; Ar) w (2^Ar)^ 2 e -( R " R, ^ 2AT e- A ^^ R ) +,/ ( R ')- 2 ^/ 2 . (4.58) 

Projection of the ground-state from the initial distribution requires the long time limit 
t — > oo, but the Green's function can only be evaluated for small time intervals. The question 
arises as to how to reconcile these conditions. The answer is that the Green's function can 
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be factored into many short time intervals and iterated 

N N 

e-* H = \{[{e- H ) l/N }=\[[e-^ H ] v (4.59) 



where At = r/N is a short time interval. 

To illustrate the symmetry between the imaginary-time Schrodinger equation and the 
diffusion equation, neglect the potential V — Et term in the Hamiltonian. The imaginary- 
time Schrodinger equation, Eq. (4.49), is a 3iV-dimensional diffusion equation 

^M = A£v?*(R,t)„ (4.60) 

i 

where A = 0.5 is the diffusion constant. The function $(R, t) can be interpreted as the 
probability distribution of walkers diffusing in time and space. The normalized Green's 
function for the diffusion equation 

G Diffusi(m (& - R'J t) = (47rAtr 3AV2 e-( R - R ') 2 / 4A *, (4.61) 

is identical to the first term in Eq. (4.58). Eq. (4.60) can be solved stochastically: an 
initial set of walkers from the distribution $(i = 0) is propagated in time using the Green's 
function, Eq. (4.61), transition probability. 

The remaining part of the Green's function in Eq. 4.58 is known as the branching factor 

e -Ar[V(K) + V(R')-2E T ]/2 = G ^ R ^ R /. Ar) = p (4 _q 2) 

This can be interpreted as a time-dependant re-normalization or re- weighting for the diffusion 
process. In pure DMC the weight of each walker is multiplied the by the branching term 
after each diffusion step. This method is not efficient since the accumulated product of the 
weights varies greatly from walker to walker. Usually one walker eventually accumulates all 
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the weight. 

An alternative approach to accumulating weights is the birth/death algorithm, where P 
is treated as a probability for walkers to advance to the next iteration. If P < 1, the walker 
continues to the next iteration at R with the probability P. If P > 1, INT[P] walkers 
continue at R and a new walker is created at R with the probability determined by the 
remainder P — INT[P], where INT denotes the integer part of a real number. These two 
cases can be implemented by the command 

N copies = INT[P + V ], (4.63) 

where rj is a random number such that < rj < 1, and N copies is the number of walkers 
continuing to the next iteration at position R. If N cop i es = 0, then the walker dies. 

The diffusion Monte Carlo algorithm described above can be inefficient. The branching 
process described by Eq. (4.63) can lead to large fluctuations where the potential V diverges. 
This is turn can lead to large statistical fluctuations in expectation values. The importance 
sampling transformation is a way to reduce these fluctuations [151]. Importance sampling 
requires the introduction of a trial or guiding function The form of the trial function is 
discussed in more detail in Chapter 6; for now it is assumed that \I/t is a good approximation 
to the ground-state $o- The trial function is used to bias the stochastic process to produce the 
distribution \I/t$o i n the long time limit, as opposed to $o in ordinary DMC. Section 4.10.4 
discusses how estimators are evaluated with respect to the mixed distribution \I/t$o- The 
importance sampling transformation also plays a crucial role in the fixed node approximation, 
see Section 4.11.2. 
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4.10.1 The Importance Sampling Transformation 

The importance sampling transformation [147] is to multiply Eq. (4.49) by the trial function 
^t- Define the importance function / = ^t&, and rearrange the terms to arrive at: 

df(R,t) 1 2 



-VV(R, t) + V ■ [V D (R)/(R, t)} + [El (TV) - E T ] /(R, t), (4.64) 



dt 2 

where V = (Vi, V 2 , . . . , Vjv) is the 3iV-dimensional gradient and E L = ^>~}H^! T is the 
previously defined local energy. The drift- velocity vector Vd, is related to the gradient of 
of the trial function: 

V D (R) = Vln|tf T (R)| = ^ T (R)- 1 V^ T (R). (4.65) 

The effect of Vd is to increase the density of walkers where the magnitude of the trial 
function is large. 

The integral equation corresponding to the modified Schrodinger equation in Eq. (4.64) 

is 

/(R, t + t) = J dR! K(R «- R'; r)/(R', t). (4.66) 
The function K(R <— R'; r) is the modified Green's function and satisfies the relation 

K(R «- R; r) = * T (R)G(R «- R'; rJ^^R'), (4.67) 

where G(R <— R'; r) is the Green's function defined in Eq. (4.53). The short-time approxi- 
mation for the modified Green's function can be represented as the product of a drift-diffusion 
and a branching process 

K(R <- R; At) w G d (R «- R; Ar)G B (R <- R; Ar), (4.68) 
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see Appendix A. 6. The diffusion part is given by 



G D (R <- R'; At) = (27iAry 3N / 2 e~ [R ~ 




(4.69) 



where the drift is defined as (AtVd(R')) and Vd(R) is constant over the short time interval 
At. The branching term is given by 



An immediate observation is that branching term now contains the local-energy El 
instead of the potential V. This is advantageous because for a good trial wave function the 
local energy surface El(R) is a smooth function, in the limit \Pt = $o the local energy is 
constant El = E and has zero variance. This is the zero variance principle discussed in 
Section 4.7. Hence branching with El has the effect of reducing the population fluctuations 
and improving the efficiency of DMC. 

The energy shift E? is chosen to keep the weights close to unity, E? ~ Eq. It is necessary 
to periodically adjust the value of Et to keep the average population of walkers roughly 
constant. A constant Et can lead to a population explosion (or implosion) for Et > E (or 
Et < Eq). In qmcPACK [3], Et is adjusted each iteration: the value of Et for the next 
iteration (i + 1) is given by the expression 



The term E est (i) = (El) (i) is the current value of the mixed estimator of the local energy (see 
Eq. (4.89)), while the second term attempts reset the current population P(i) approximately 
no iterations (generations) later the to a target population P . Typical values of uq are on the 
order of 1/ At. The feedback of the current number of walkers into Et introduces a systematic 
bias, referred to as the population control bias. For a more detailed discussion of this bias 



G B (R<-R';r) =e 



Ar[E i (R)+_B i (R')-2£; T ]/2 



(4.70) 




(4.71) 
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and how to remove its effect on estimators see Ref . [8] . The bias can generally be suppressed 
by using a large number of walkers. Figure 4.2(a) compares the mixed estimator of the local 
energy to Et for the SisH^ nanocluster; Figure 4.2(b) plots the walker population. 



4.10.2 Sampling the Drifted Gaussian 

Ignoring the branching term [El(R) — Et] , Eq. (4.64) can be written as 



0_ 

Of 



/(R, t) = \ ^ V 4 • [V, - Fi(R)] /(R, *). (4.72) 



where F» = 2Vj\l / T/^r is related to the ith component of the drift- velocity. This is a 
Fokker-Planck equation [162] for isotropic diffusion in 3iV-dimensions, with the a steady 
state solution of / = ^ T . 

Fokker-Planck trajectories are generated by a Langevin equation. The Langevin equation 
for Eq. (4.72) is 

^ = AF[R(i)]+^, (4.73) 

where v is a Gaussian random variable of mean zero and variance 2A. A discrete form to 
Eq. (4.73) can be derived by integrating over short time interval At : 

R' = R + AF[R(t = 0)]At + x, (4.74) 



where F is constant over the time interval, and x is a Gaussian of mean zero and variance 
2XAt. The discrete form of the Langevin equation in Eq. (4.74) prescribes a move for a walker 
from R to R' and is useful for Monte Carlo, but is an approximation to the continuous form 
in Eq. (4.73). The result of using Eq. (4.74) is the introduction of a bias in the trajectories 
for all time steps At > 0. The bias increases with the step size and can be corrected by the 
introduction of a Metropolis accept/reject step. 
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The Green's function for the Fokker-Planck equation 



K FP (TL^R';t) = (R|e A H v '- v -*-*- v J|R'), (4.75) 

obeys an identical relation to Eq. (4.72) and has the boundary condition K(R <— R';t = 
0) = 5(R — R'). By assuming F is constant over a short time interval At (between R' and 
R) it is possible to obtain the relation 

K FP (R <- R; At) « ( 47rA At)- 3iV / 2 e-( R - R '- AF ( R '« 2 / 4AAt . (4.76) 

This is the drift-diffusion part of the Green's function in Eq. (4.69). The move prescribed 
by Eq. (4.74) is consistent with Eq. (4.76); a drifting Gaussian with variance XAt and drift 
2AV(ln tf T ). 

Going back to the generalized Metropolis method, the move prescribed by Eqs. (4.74, 
4.76) can be treated as a trial-step probability and is accepted with the probability 

A(R <- R) = min[l, Q(R <- R)], (4.77) 



where 

„ N _ K FP (B! <- R; At)^ T (W 2 
i^irp(R <— R'; At)\E'|(R 



q(r - r') = ^z,^: v ^ 



4.10.3 The Metropolis Accept/Reject Step 

The exact importance-sampled Green's function for the modified Schrodinger equation, 
Eq. (4.64), satisfies the condition 

K(R <- R; Ar)^(R') = K(R! <- R; Ar)^(R), (4.79) 
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which can be used to impose detailed balance. 2 The Green's function Grj(R <— R'; Ar) ap- 
proximately satisfies an identical relation to Eq. (4.79), this can be shown by the substitution 
K(R «- R'; Ar) « G d (R <- R'; Ar)G B (R «- R; Ar) and the symmetry of the branching 
term Gb(R «— R'; Ar) = Gb(R' <— R; Ar). For the trial move from R' to R prescribed by 
G_d(R <— R'; Ar), detailed balance is imposed by accepting the move with the probability 

A(R<-R') = miii[l,Q(R<-R')], (4.80) 

where 

cm : W) ^(R^R;Ar)v& r (R) 2 , , 

Q(R<-R)- GD(R _ R , ;Ar) *2 (Rr ( 4 - 81 ) 

This step is only necessary because a short time approximation is used for the Green's 
function K. As Ar — > 0, Q — > 1 and the approximate Green's function more nearly satisfies 
the detailed balance condition, Eq. (4.79). The accept/reject step in Eq. (4.80) ensures 
that the proper distribution is sampled for any Ar. In practice Ar is chosen such that the 
acceptance ratio is greater than 99% : rejecting too many moves has been found to bias the 
unknown distribution /(R, r) [30, 31]. If branching is ignored than the steady state solution 
is / = |^ T | 2 , and DMC reduces to VMC. 

The Metropolis accept/reject step, Eq. (4.80), allows any transition probability pre- 
scribed by G_d(R <— R'; Ar), provided that the reverse probability is finite [154]. A finite 
cutoff for the drift velocity Vd is introduced to prevent divergences which can occur at 
singularities in the potential or at the nodal surface of ^>t (for a discussion of the nodal 
surface see Section 4.11.1). The smooth cutoff proposed by Umrigar, Nightingale and Runge 
(UNR) [8] 

-1 + JI + ZV^At 
V ° = V*Ar V ° (482) 

is implemented in qmcPACK [3] . The modified drift velocity reduces to usual drift veloc- 



2 Eq. (4.79) can be verified by the definition of K(R <— R'; At) in Eq. (4.67) and the symmetry of 
G(R «— R'; At) upon the interchange of R and R'. 
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ity Vd for small V^Ar, however the magnitude of Vd is limited to a/2/At for large V^At. 
The presence of a large drift velocity can lead to occurrence of persistent configurations. 
These are defined as configurations which are stuck in the same position in configuration 
space and can have a catastrophic effect on estimators such as the local energy. Another 
method that is used to prevent persistent configurations involves assigning an age to each 
walker. If a walker becomes old - it has not moved in several iterations - its weight is reduced, 
increasing the likely-hood that it is killed in the branching procedure. 

The actual time the electrons drift and diffuse is less than At, due to the Metropolis 
rejection step in Eq. (4.80). For this reason some DMC algorithms, following Refs. [8, 30], 
define an effective time step. The effective time step Ar e ff is given by the ratio 



Ar e// = At 



(AR 2 ) 



accepted 



(AR 2 ) 



total 



(4.83) 



where AR are the displacements resulting from the diffusion part of the Green's function 
and the angular brackets denote an average over all the moves. The effective time step is 
used in the branching part of the Green's function Gb- 

4.10.4 The Evaluation of Estimators 

The computation of ground-state expectation values in DMC is complicated by the fact 
that in the long time limit DMC yields a set of walkers with the probability distribution 
/ = ^t^Oi where $o is the many-body ground-state. At first sight this might be seen 
as being problematic because ground-state expectations are computed with respect to the 
ground-state distribution |$o| 2 - 

The mixed estimator is a method to compute the expectation value of an observable with 
respect to the mixed distribution / : 

{A)mW ~ <$ |*r> ' ( } 
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which in integral form can be written as 

/rfR$o(RM(R)vMR) 
{A)DMC = fdR$ (R)* T (R) ' (485) 

it is assumed that the operator A is diagonal in position space. For any operator that 
commutes with the Hamiltonian H, it can be shown that the mixed estimator is equal to 
the true quantum mechanical estimator: 

<"W = 7*^r = W (486) 

The proof of Eq. (4.86) follows from the fact that <3>o can be written as a projected state 
($ = limr-oo e- rH V T ) ■ 

/dR* T (R)e^$ T (R) 

\A)dmc = hm 



/ dR*r(R.)e- TH *r(R) 

f dR $ T (R)e" T/f/2 ie- Tff /^ T (R) 
lim — — - 

r-oo JrfR* T (R)e-^/ 2 e-^/ 2 * T (R) 

/ rfR$o(R)^$ (R) 
J riR$ (R)$ (R) ' 



(4.87) 



The commutation of A and H was necessary to move A between the two halves of the 
projector [154]. 

For the case A = H it follows 



/ dR$o(R)^r(R) / dR$o(R)^T(R) 

/ d~R /(R)-Ex(R) 



J dR%(R)H^ T (R) J rfR$ (R)*T(R) [^(RJtf^MR) 

(4.{ 



/dR/(R) 



i=i 

Hence the estimator for the local energy (El) is a measure of the exact ground-state energy 
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Table 4.3: The HF, VMC and DMC total energy results for several atoms and molecules 
compared to experiment. Atomic orbitals from Ref. [5]. All experimental (non-relativistic) 
results quoted from Ref. [7] and references therein. All energies in a.u. 
a bond length 1.401 Bohr. 





HF 


VMC 


DMC 


Expt. Ref [7] 


He( 1 5) 
Li( 2 5) 
Be( 1 5) 
C( 3 P) 


-2.86168 
-7.43273 
-14.5730 
-37.6886 


-2.88832(84) 
-7.46046(29) 
-14.5733(22) 
-37.6908(24) 


-2.90384(23) 
-7.47663(93) 
-14.6655(21) 
-37.8318(25) 


-2.9037 
-7.4781 
-14.6673 
-37.8451 


(H 2 ) a 




-1.157934(55) 


-1.174383(89) 


-1.17447 



Eq. 

Expectation values of operators that do not commute with H can be computed by the 
extrapolated estimator 

{A) EXT — 2(A) DMC ~ 

(A)vmc + O[(V t -<5> ) 2 ], (4.89) 

the VMC estimator corrects the mixed estimator from DMC to extrapolate the ground-state 
estimate. The accuracy of the extrapolated estimator depends on the quality of the trial 
wave function, i.e. how close ^>t is to the true ground-state $o- Examples of operators that 
do not commute with H are correlation functions, structure factors and probability densities. 

An alternative method to evaluate the estimators of operators that do not commute with 
H is by the future or forward walking technique, discussed in Ref. [163]. This method is 
computationally more intensive than evaluating the extrapolated estimator, and has been 
criticized for being numerically unstable [164]. The forward walking technique is currently 
not implemented in qmcPACK [3]. 
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Table 4.4: Comparison of QMC results from qmcPACK [3] to the published results of Um- 
rigar, Nightingale and Runge (UNR) [8]. For the trial wave function of Ne, see Table V 
in Ref. [8]; for the trial wave function of Li 2 , see Table VI in Ref. [8]. The Li 2 dimer is 
oriented along the z-axis. The electron-electron Jastrow correlation is the Pade form and is 





qmcPACK 
VMC DMC 


UNR 
VMC DMC 


Ne 


-128.71457(85) -128.928(3) 


-128.713(2) -128.922(4) 


Li 2 


-14.94726(27) -14.99063(69) 


-14.9472(2) -14.9890(2) 



4.11 Diffusion Monte Carlo for Fermion Systems 
4.11.1 The Fermion Sign Problem 

The discussion of the DMC method has implicitly assumed that the distribution / = 
is positive over the full domain, such that it can be interpreted as a probability distribution. 
However, for a systems of interacting fermions this assumption is no longer valid. The 
particle exchange antisymmetry condition, Eq. (2.14), implies that there must be positive 
and negative regions of the wave function and hence the Green's function. 3 

One way to deal with the fermion-sign problem is to assign walkers positive or negative 
signs, but such methods have proven to be unstable [31]. In the widely used fixed-node 
approximation [148, 149, 152, 30], the unknown nodal surface of $0 is chosen to be identical 
to the nodal surface prescribed by a trial wave function such that / is non-negative over 
the full domain [30]. The nodal surface of which is a function of 3N— dimensions, is a 
(3iV — 1) hyper-surface on which ty? is zero and across which it changes sign. The fixed-node 
DMC (FN-DMC) algorithm succeeds to produce the lowest many-electron state consistent 
with the nodal surface of \&t- Before examining the fixed-node approximation in more detail 
it is useful to discuss the more general fixed-phase approximation [165]. 



3 It should be noted that fermion statistics do not represent a complication for VMC since \^t\ 2 is always 
positive. 
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4.11.2 The Fixed-Phase and Fixed-Node Approximations 

The fixed-phase approximation [165] is applicable to systems that do not obey time-reversal 
symmetry. The Hamiltonian for a system of N interacting fermions in an external magnetic 
field B = V x A (in atomic units) 



H = - Yl H V < + « A ( r -')] 2 + V{ ?^ r 2> • • • > r ^v) 



(4.90) 



is an example of a Hamiltonian that is not time-reversal invariant. The eigenfunctions of 
this Hamiltonian are complex. A generic state $(R) can be written in terms of a modulus 
|$(R)| and a phase v?(R) : 

$(R) = |$(R)|exp[^(R)], (4.91) 

where |$(R)| and (p(R) are real functions. The real and imaginary parts of the Schrodinger 
equation yield, respectively, a pair of coupled differential equations for |$(R)| and (p(R) : 



|$(R)| = E\<$>(R) 



(4.92) 



and 



£ V, • {|$(R)| 2 [V^(R) + «A(r,)]} = 0, 

i 

where the effective potential Vfp is given by 



(4.93) 



v fp (r) = ±J2 ( v ^( r ) + aA i 



(4.94) 



Eq. (4.92) for |$(R)| is a boson problem, since |$(R)| > 0, but it is coupled to Eq. (4.93). 
The fixed-phase approximation consists of choosing an appropriate trial function \&y(R), 
with the symmetry of the desired many-body ground-state $o(R)> an d fixing the unknown 
phase of $o(R) to be identical to the phase of the trial function, <^y(R) = S(log ^r(R)). For 
the phase determined by (^r(R), the boson problem for |$(R)| is solved exactly by DMC. 
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The fixed-node approximation is a special case of the fixed-phase approximation, and 
is applicable when time-reversal symmetry is obeyed, i.e. A = 0. The phase of the state 
can either be or ir, corresponding to regions where the wave function is either positive or 
negative, respectively. The phase changes from the values to 7r discontinuously at the nodes 
of the trial function: <p(R.) = ir[l — 0($t(R-)/| < ^(R)|)]) where 9 is the unit step (Heaviside) 
function. The corresponding fixed-node potential 

V FP (R) = V FN (R) =7r 2 (V^ r (R))V(^ T (R)), (4.95) 

has the effect of not allowing paths in configuration space that cross the nodal surface. 
Multiplying Eq. (4.92) by the trial function ty? when A = yields an identical relation to 
Eq. (4.64): the modified Schrodinger equation for the state /(R, r) = \1/t(R)3 ) o(R) r )- f 
can now be interpreted as a probability distribution since it is positive. 

The implementation of the fixed node method within the importance sampling DMC 
algorithm (see Section 5.3) is straightforward. Any proposed move which crosses the nodal 
surface is rejected: K(R <— R'; r) = if R and R' are in different nodal pockets. Eq. (4.92) 
is solved for each nodally bound volume separately; each solution can be viewed as the Bose 
ground-state for that volume with hard wall boundary conditions on the enclosing nodal 
surface. The only communication between the different volumes is by the trial energy Et-i 
see Eq. (4.71), which is adjusted to maintain an average walker population. It should be 
noted that some algorithms kill walkers that cross a node, but this practice has been found 
to increase the time step error [8, 31, 166]. qmcPACK by default rejects moves which cross 
the nodal surface [3]. 

The result of the stochastic process defined above is the lowest energy many-electron 
state consistent with the nodal surface of The accuracy of the method is determined 
by how closely the nodal surface of ^>t approximates the true ground-state nodal surface. 
The fixed-node DMC energy is always greater than or equal to the true ground-state energy 
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(Efn > Eq), with equality holding in the limiting case when the nodes of \&t are exact. In 
this sense fixed-node DMC is a variational method. 4 

The number of walkers attempting to cross the nodes of vanish as At — > 0. For any 
finite time step At, walkers will rarely attempt to cross the nodes due to the approximate 
nature of the Green's function K. This can be understood by inspection of drift-diffusion 
Green's function, Eq. (4.69). The drift term, which acts to push walkers away from the 
nodes, is proportional to At, while the standard deviation of the Gaussian from diffusion is 
proportional to \J At. Since At is always a small number, At < 1 a.u., the diffusion term can 
occasional overcome the drift, resulting in a node crossing. The failure of the approximate 
form of the Green's function is especially pronounced near the nodes where the local-energy 
and the drift-velocity diverge, and at singularities in the potential (electron-electron and 
electron-ion overlaps) which can result in divergences of the local energy and a discontinuity 
of the drift velocity. 

4.12 Excited States in QMC 

So far almost all of the discussion pertaining to the QMC method has been confined to 

ground-state properties. The trial wave function has been assumed to be an approximation 

to the iV-electron ground-state: =>- |iV; 0) . In general we are interested in electronic 

excitations, which in QMC are calculated as total energy differences. Electronic excitations 

such as electron addition and removals can be calculated as the total energy difference of the 

ground-states of the N and N ±1 systems, i.e. \E^ — En±i\- This is the idea behind ASCF 

methods discussed in Section 2.3.2. For particle conserving excitations such as the optical 

gap, it is necessary to construct the trial wave function for the excited state: \Pt \N;i), 

where the index % denotes the excited state of the N electron system [167, 168, 31, 45]. 

Usually such excited state wave functions are created by promoting an electron to an excited 
4 For a proof see Ref . [30] . 
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state orbital in the Slater determinant part. Such approaches have been used for molecules 
and clusters in Refs. [2, 57, 67, 169, 66]. This approach has also been used in solids to 
calculate band gaps. Previous QMC calculations of band gaps include atomic solid nitrogen 
in the 72x3 structure [170], diamond [171, 168] and silicon [172]. 

The VMC method can be implemented with any trial wave function. The zero-variance 
principle, see Section 4.7, applies to all the eigenfunctions of a given Hamiltonian H, in- 
cluding the ground and excited states. For a trial wave function of a distinct symmetry, 
the variational principle guarantees that the energy is greater than or equal to the lowest 
eigenvalue of that symmetry. 

At first it might seem that DMC cannot be extended to excited states, since the DMC 
algorithm projects the lowest energy state of H in the long time limit. As we have already 
discussed in Section 4.11.2, DMC in the fixed-node approximation converges to the lowest 
energy many-body state consistent with the nodes of a trial wave function ^t- The impli- 
cation is that if the nodal surface of \&t is identical to the nodal surface of an eigenfunction 
of H, DMC will project the exact eigenfunction. Therefore, the fixed-node approximation 
prevents a collapse to the ground-state. If does not have correct nodal surface of the 
desired excited state, the DMC energy may not be above exact energy of that state due to 
mixing of lower energy states. 

It was widely believed that the DMC energy must be greater than or equal to the energy 
of the lowest eigenfunction with the same symmetry as the trial wave function. Foulkes 
et al. [167] showed that this assumption is only justified if the trial wave function trans- 
forms according to a one-dimensional irreducible representation of the symmetry group of 
the Hamiltonian H. If ty? transforms to a mult i- dimensional irreducible representation, cor- 
responding to a degenerate eigenfunction, the DMC energy may lie below the energy of the 
lowest eigenfunction of that symmetry. Despite this, DMC in the FN approximation works 
well, regardless of the dimension of the irreducible representation. 
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4.13 Calculating the Optical Gap in QMC 

QMC is an excellent approach to calculate optical properties since optical excitations are 
inherently many-body in character: including the interaction of the electron-hole pair with 
all of the other electrons. Optical excitations in QMC are determined as the total energy 
difference between the ground-state configuration and an excited state configuration 

E OPT = E EX _ E GS_ (496) 



The disadvantage of this approach is that it becomes difficult for QMC to attain the ab- 
sorption spectrum of the nanoclusters over a large energy range. Our interest is confined to 
the optical gap, which is the lowest dipole allowed singlet transition in the absorption spec- 
trum. All of the QMC calculations are for vertical excitations in the ground-state structure 
of the constituent atoms (see Figure 1.1); structural relaxation of the excited state was not 
undertaken. 

The excited state configuration in Eq. (4.96)is created by removing an electron from the 
highest occupied molecular orbital (HOMO) and promoting it to an excited state molecular 
orbital in the spin-up Slater determinant: 



^N/2^1 



'lpN/2(rN/2) 



'0AT/2+l(ri) ... '(/>7V/ 2 +l(rjV/ 2 ) 



(4.97) 



For the optical transition to be allowed, a dipole matrix element between the HOMO and 
excited molecular orbital must exist. The excited molecular orbital is normally chosen to 
be the lowest unoccupied molecular orbital (LUMO), see Eq. (4.97), unless the HOMO — > 
LUMO transition is optically forbidden. This was found to be the case for some of the 
Ge n H m and Si n H m nanoclusters, such as Ge 5 H 12 . 
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In order to construct the true singlet excited state, it is necessary to add a second set of 
determinants where the down-spin electron is excited: 

(4.98) 

where D denotes a Slater determinant where an electron has been excited. The inclusion of 
the second set of determinants has been found to be a negligible effect to the optical gap 
[57, 2] and is neglected in this work. 
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(b) The FN-DMC population. 

Figure 4.2: The results of a Fixed-Node DMC calculation for the SisH^ cluster. The time- 
step is 0.025 a.u., the target population is 320 walkers and tiq = 100 (see the accompanying 
text). 
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Chapter 5 

QMC Algorithms and QMCPack 



5.1 Introduction 

This chapter presents the algorithms for VMC and FN-DMC used in the qmcPACK quantum 
Monte Carlo package [3]. The development of qmcPACK was one of the most important 
aspects of the work in this thesis. qmcPACK was written in C++ [173] using the Standard 
Template Library (STL) and libraries by Boost [138], HDF5 [174], BLAS and LAPACK. 
For the parallelization we used the MPI package [175]. Previous versions of qmcPACK 
have used the GNU Scientific Library [176], BLITZ++ [177] and SPRNG [139]. All of the 
QMC calculations performed for this work are by qmcPACK unless stated otherwise. In 
addition, we have released the code to the public and are currently working on adding new 
components, improving algorithms and a more complete manual and documentation. 

The VMC and FN-DMC algorithms presented in this chapter are for a particle-by-particle 
move, in which each of the particles (electrons) are moved individually and either Metropolis 
accepted/rejected. It is also possible to use an all-electron move, in which all of the electrons 
of the system are moved simultaneously and the move is Metropolis accepted/rejected. This 
approach is more closely related to the derivations in Chapter 4. qmcPACK features both 
types of moves. The advantage of the particle-by-particle move is that it permits a much 
larger step size to attain a given acceptance ratio. More importantly, for systems containing 
large number of electrons, it is unfeasible to use an all-electron move. 

Caution should always be exercised when performing a QMC simulation featuring a 
particle-by-particle move on a system with the bare (—Z/r) Coulomb potential. By choosing 
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a small enough step size, it is possible to attain a reasonable acceptance ratio, but it is often 
the case that the acceptance rate for moves involving the inner core electrons are much 
smaller than the acceptance rate for moves involving the outer valence electrons. This is due 
to the differences in the energy and length scales between the core and valence electrons. 
Such a disparity between the acceptance rates make any QMC result suspect. 

5.2 Variational Monte Carlo Algorithm with 

Importance Sampling for Particle-by-Particle 
Move 

The algorithm presented in this section is for the importance sampled version of VMC. 
This algorithm is very similar to the Fixed-Node DMC algorithm presented in the next 
section; the main difference is that branching is suppressed in VMC. The time-step does not 
correspond to an interval in imaginary time as in DMC, but is used to control the size of 
the trial move and therefore the acceptance ratio. 

The algorithm incorporates the concept of blocking discussed in Section 4.5.1; the esti- 
mators are printed at the end of each block. It should be restated that it is always necessary 
to equilibrate the data before performing any subsequent data analysis. All the weights in 
VMC are unity, 
for all blocks do 
for all steps do 

for iw G walkers do 

Reset the variables: nAcceptTemp, nRejectTemp, = 
for ip e particles do 

For the particle at position rj£ drift and diffuse: r^^ = rf™' + x + ^(rl™'), 
where x is a gaussian of mean zero and variance r. 
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Accept the move with probability: A(RW -> R /( ™ } ;r) = min[l, Q(RW -> 
R' (i,u) )], where Q is defined in Eq. (4.81). 
if The move is accepted then 

Update the walker properties 

nAcceptTemp + + 
else 

nRejectTemp + + 
end if 
end for 

Update the walker properties 
end for 

nAccept+ = nAcceptTemp, nReject+ = nRejectTemp 
for all walkers do 

Accumulate estimator for observable (A): 

A sum + = WW A« w \ W sum + = WW 
end for 
end for 

Write estimator to file and reset variables: A sum , W sum = 
end for 

5.3 Fixed-Node Diffusion Monte Carlo Algorithm for 
Particle-by-Particle Move 

This section presents the version of the FN-DMC algorithm used in qmcPACK [3]. The 
algorithm follows the derivations in Chapter 4. 

For DMC it always necessary to start the calculation with an initial set of walkers (en- 
semble). These walkers come from a previous VMC calculation and sample the distribution 
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where ^>t is the trial wave function. The FN-DMC algorithm starts from this distrib- 
ution and iteratively projects the mixed distribution $o^r, where <3>o is the lowest energy 
state consistent with the nodal surface of \I/t- Estimators are calculated from this mixed 
distribution as in Section 4.10.4. 

For each walker it is necessary to store a set of properties, in addition to the coordinates 
of the N— electrons: R. The following walker properties are typically stored: 

• The local energy: El(R) 

• The square of the local energy: El(R) 2 

• The drift velocity: Vd(H) 

• The weight: W 

• The multiplicity: M 

• The Age: Age 

The multiplicity is used for branching. In qmcPACK [3] we also store the constituent parts 
of the local energy. 

As already mentioned the DMC algorithm is designed for an ensemble of walkers. Since 
the walker population is changing throughout the run we represent the walker ensemble by 
a linked list: for a discussion of linked list see Ref. [178]. A ceiling and a floor are set for 
the walker population. The ceiling is twice the ideal population while the floor is half the 
ideal population. If the walker population exceeds the ceiling our algorithm kills the extra 
walkers. On the other hand, if the walker population is below the floor extra walkers are 
created. If either of these situations arise it is recommended to re-run DMC with a smaller 
time-step. 

for all blocks do 
for all steps do 
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for iw G walkers do 

accepted ' 



Reset the variables: nAcceptTemp, nRejectTemp, Ai?^, ted , AR^ otal = 



for ip G particles do 

For the particle at position r ™^ drift and diffuse: r'[p = + x + r ^D( r i™ 
where x is a gaussian of mean zero and variance r. 
Ai^Hx + rV^r^)! 2 
Ai^ w + = AR* 

if There is a node crossing: [^(R'V^^R) < 0] then 

Reject the move: nRejectTemp + + 
else 

Accept the move with probability: A(R^ -> R /(iu,) ;r) = min[l, Q(R (iw) 
R /(iw) )], where Q is defined in Eq. (4.81). 
if The move is accepted then 

Update the walker properties 

nAcceptTemp + + 

AR 2 accep ted+ = AR 2 
else 

nRejectTemp + + 
end if 
end if 
end for 

if All particle moves rejected then 
Increment the Age: Age^ iw ' + + 
else 

Update the walker properties 
end if 

Calculate the effective time-step: r e // = rA ^ cepted 
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Calculate the branching probability: P w = exp 
Reweight the walker: W {iw) * = P (iw) 
if The age of the walker is greater than one: AgeW > 1 then 

Rescale the branching probability: p( iw ) = min[l, p( lw )] 
end if 

Set the multiplicity: = + £, where f is a random number G [0, 1] 

Determine number walkers that advance to the next generation from the multiplic- 
ity: N^J es = INT[MW] 
if <t = then 

Kill the walker 
end if 
end for 

nAccept+ = nAcceptTemp, nReject+ = nRejectTemp 
for all walkers do 

Accumulate estimator for observable (A): 

A sum + = WMAW, W sum + = 
end for 

Perform branching. 

Reset the walker weig hts and multiplicities: W {iw \ M^ iw) = 1.0 
return The new population of walkers P 
Update the trial energy: E T , see Eq. (4.71). 
end for 

Write estimator to file and reset variables: A SU m, W sum = 
end for 



-T eff (E L (R^) + E L {R'W) - 2E T )/2 
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5.4 Parallelization of QMC 

The desirability of the QMC method is that it is fairly straightforward to parallelize and 
requires minimal inter-processor communication. Parallelization reduces the load on each 
processor and reduces the total computation time. In the context of QMC, parallelization 
means distributing the walkers among several processors. This is important because typical 
QMC simulations (especially DMC) may require several hundred walkers, with paralleliza- 
tion each processor is only responsible for a fraction of the total number. Parallelization is 
essential for the systems containing large numbers of electrons (such as the clusters simu- 
lated in this thesis); placing a large number of walkers on a single processor is unfeasible. 
With a parallel version of QMC it is possible to assign a manageable number of walkers on 
each processor while maintaining a large total walker population. 

The most trivial parallelization is to perform independent simulations on each processor 
and collate the results for statistical analysis. No inter-processor communication is required 
for this method, although it is necessary to assign different seeds to the random number 
generator for each of the processors. 

qmcPACK [3] also features versions of VMC and DMC that incorporate inter-processor 
communication. For VMC a fixed number of walkers are assigned to each processor and un- 
dergo independent random walks; inter-processor communication is required to collate the 
results (for estimators such as the local energy) at the end of each block. This method can- 
not be extended to DMC, which has a fluctuating walker population. The DMC algorithm, 
courtesy of the branching, depends on the total walker population over all of the individual 
processors. In addition to the total walker population, the trial energy Et, which is in part 
determined by the population, is also a global property. It is necessary to keep track of the 
global walker population and regulate the number of walkers on each processor. qmcPACK 
regulates the walkers per node by swapping walkers between individual processors: a maxi- 
mum difference of only one or two walkers is allowed to exist between the nodes. Appropriate 
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actions are taken if the global walker population becomes too large or too small. 



103 



Chapter 6 



The Quantum Monte Carlo Trial 
Wave Function 

6.1 Introduction 

The accuracy and statistical efficiency of any QMC simulation depends upon the quality of 
the trial wave function 1 The requirements on \&r are that it must be continuous and 
differentiable wherever the potential is finite, and the integral j dH^/xH^T must exist [31]. 
In order for v&r to describe a system of interacting fermions, it must obey particle exchange 
anti-symmetry 

*(xi, . . .,Xj, . . .,-Xj, . . . ,Xjv) = -^(Xi, . . . ,Xj, . . . ,Xi, . . . ,Xjv), (6.1) 

where Xj = {rj, c^} represents the space and spin coordinates of the ith electron. 

A specific form that explicitly includes correlation in a many-body sense is known as the 
S later- Jastrow type: 

tf T (X) = D(X)e- f/(x) , (6.2) 

where X = (xi, X2, . . . , xjv) denotes the spin and space variables for the iV-electrons, D 

is a Slater determinant and U is a Jastrow correlation function. The Slater determinant 

automatically satisfies the particle exchange anti-symmetry requirement when the Slater 

matrix elements are single particle orbitals, usually obtained from Hartree-Fock (HF) or 

1 It is necessary to distinguish between the definitions of accuracy and precision, and how they pertain to 
QMC. The accuracy is determined by the quality of the trial wave function - in other words the approximation 
used to construct the trial wave function. The precision is related to the statistical uncertainty, and is 
determined by the number of Monte Carlo steps and the statistical correlation between each Monte Carlo 
step. 
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Kohn-Sham density functional theory (DFT). The Jastrow [179] factor typically contains 
electron-ion, electron-electron, and even electron-electron-ion or higher order correlations. 
The Jastrow factor is positive and multiplicative, hence the nodes of \Pt are determined 
solely by the Slater determinant terms. 

For any spin-independent Hamiltonian H, it can be shown that the position space function 

vI> T (R) = Dt(n, . . . , r N ^(r Nr+1 , . . . , r N )e~ u( - R \ (6.3) 

gives the same expectation value as the spin-space function \]/y(X) in Eq. (6.2) [31]. This 
is true for any spin-independent operator and eliminates the need to sum over the spin 
variables. The spin- up electrons, by convention labeled (r 1; . . . , rjv T ), are treated as being 
distinguishable from the spin-down electrons (r/v T+1 , • • • , Tn)- One consequence of using the 
position space function in Eq. 6.3 is that opposite spin electrons are not antisymmetric upon 
exchange of their spatial coordinates. 

The most general form of the trial wave function in Eq. (6.3) is a linear combination 
of Slater determinants ^2iCiDjD^ . Multi-determinant trial wave functions typically arise in 
Configuration Interaction (CI) calculations, see Section 3.2. It also may be necessary to 
use a multi-determinant if the state must have a correct total spin S = X^S^) or angular 
momentum L = £\ L(i). A single Slater determinant is an eigenfunction of the z— component 
of the spin, S z = J2i S z {i), and angular momentum, L z = Y^iLz{i), but not necessarily the 
total spin or angular momentum. 

It is generally desirable to use the simplest possible trial wave function since the most 
computationally intensive part of any QMC simulation is evaluating the trial wave function 
and its derivatives. The following sections detail how to correctly evaluate the Slater- Jastrow 
trial function. 
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6.2 Evaluating the Slater Determinant 

The Slater determinant is the determinant of the Slater matrix and is denoted by 



D(R) = detGD(R)), 



(6.4) 



where the matrix is underlined to prevent confusion with the determinant. The elements of 
the Slater matrix for the electrons are from a set of N single-particle orbitals 



(6.5) 



We assume that the single particle orbitals are a function of a single coordinate. The Slater 
matrix can be represented by 



/ 



D(R) 



0i(ri) 2 (ri) 
</>i(r 2 ) 02(r 2 ) 



>M r iJ 
W(r 2 ) 



(6.6) 



y 4>i(r N ) 4> 2 (r N ) ■■■ 4> N (r N ) J 



for the configuration R = (ri, r 2 , . . . , tn). 

For the purposes of evaluating the Slater determinant [180], it is necessary to define the 
following quantities: 

A. The minor for the element Dij (of the N x N matrix) is the determinant of an 
(N — 1) x (N — 1) matrix obtained by removing all the elements of row % and column j. 

B. The co-factor matrix C_ is constructed on an element by element basis from the minor 
using the simple relation: 



Ci 
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C. The inverse of the matrix D_ obeys the following relation 

I = D^D, (6.7) 

where D^ 1 is the inverse matrix. The inverse is related to the transpose of the cofactor 
matrix by the relation 

(D-% = { ^f = % (6.8) 

where D is the determinant. 

Using these definitions, the determinant is evaluated as the sum of the products of the 
elements of any row or column of the matrix and the corresponding cofactor 

N 



D = ^^DijCij — ► along row i (6.9) 
i=i 

N 

= ^^DijCij — ► along column j. (6.10) 

i=l 

In quantum Monte Carlo it is necessary to evaluate the ratio 

^( R/ ) _ D ( R ') r -u(R')+u(R) (6U] 
*(R) D(R) ' 1 J 

where R' = (r 1; r 2 , . . . , r^, . . . , tn) differs from R = (ri, r 2 , . . . , r*, . . . , rjy) only in the coordi- 
nates of the ith particle: D can either be the up or down determinant depending on the spin 
of particle %. It is important to note that the underlying Slater matrix ZZ(R') is identical 
to i2(R) except for the ith row. Hence the elements of the cofactor matrix C_ along row % 
remain the same, since Cjj is constructed from all the other rows of the Slater matrix except 
row i; therefore C^- is not a function of the coordinate rv Using Eqs. (6.8) and (6.10), the 
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ratio of the determinate at R' to the determinate at R is [150] 

g D(R) D(R) 

TV 

= J2 D ^ R ') D J^ R )- ( 6 - 12 ) 

The reason for using this relation is that it eliminates the need to repeatedly evaluate the 
determinant for every Monte Carlo step, which takes an order of (9[iV 3 ] operations. If 
the move from R to R' is accepted (particle i), the inverse Slater matrix Z} _1 must be 
updated, which only requires OfiV 2 ] operations. The matrix is actually the inverse of 
the transpose of D_. The new inverse Slater matrix (at R') can be calculated from the old 
inverse matrix by the relation 

D£W) = \ lk{ , (6.13) 



Dj k i (R) - ^-/(R') E;Ii D-\-R)D a (R>), k ± i 

where q is the ratio from Eq. (6.12). 

The ratio for a multi-determinant wave function is 

Ei=i CiA(R') _ Ei=iCiftA(R) 



E j=1 qA(R) Ei=iCiA(R) 



(6.14) 



where ^ = A(R')/A(R) from Eq. (6.12). 

The derivatives are computed in a similar manner to the ratio in Eq. (6.12). To calculate 
the gradient with respect to particle i, expand the determinant in terms of its co-factors 
along row % and differentiate 

TV 

ViD(R) = X)(ViAi)CV (6.15) 
where Z)jj(R) is a function of r-j and is a function of all of the other coordinates except 
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rj. This leads to the important relations: 



V*P(R) 
D(R) 



(6.16) 



and 



D(R) 



£(V?fly(R))I#(R). 



(6.17) 



6.3 Single- Particle Orbitals 

The following sections describe the different types of basis functions used in this work to 
construct the single particle orbitals, including Slater Type Orbitals (STO)s and Gaussian 
Type Orbitals (GTO)s and Numerical Orbitals. For atoms these orbitals are commonly 
referred to as Atomic Orbitals (AO)s, while for molecules they are referred to as Molecular 
Orbitals (MO)s. The orbitals can be expressed in terms of a linear combination of basis 
functions centered at the sight of each atom. 

The advantage of STOs over GTOs is that they have a form which is closer to an atomic- 
like orbital and therefore better satisfy the cusp conditions at the atomic nuclei. Unfortu- 
nately, STOs are more difficult to integrate than GTOs and analytic formulas do not exist. 
2 This is the reason why GTOs are more commonly used in electronic structure codes. In 
general it is possible to fit a set of Gaussians to construct an STO; this approach is used to 
construct the basis for the hydrogen atom. 

6.3.1 Gaussian Type Orbitals and the Gaussian 03 Basis 

In the Gaussian03 package [11] the molecular orbitals are written as a linear combination of 
local orbitals 



2 In Kohn-Sham DFT or Hartrcc-Fock methods the integrals are matrix elements of the Hamiltonian. 




(6.18) 
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where the sum i runs over all the atoms and the sum j runs over all the local orbitals 
centered on atom % at position Rj. The form of the local orbital is the sum of one or more 
primitive Gaussians multiplied by a polynomial and a real spherical harmonic 



N 



^^V"^ 2 » ( 6 - 19 ) 



i=i 



where N is the number of primitive functions, i.e. the degree of contraction, the {c^} are 
the contraction coefficients, the {a{\ are the exponents, is a scale factor common to all 
exponents and Si m is a real spherical harmonic (see Appendix A. 1.1). The factor allows 
for more variational freedom of the basis but for our applications it is usually set to one or 
can be absorbed into the 

It is convenient to factor out of the contraction coefficients a normalization for each 
primitive Gaussian: q = iVj(aj)&j. The normalization factor Ni(oii) is determined by the 
requirement 

1 = (6.20) 
since the Si m are normalized this condition gives 

poo 

l = Nf(a) / drr 2 ^e- 2ar \ (6.21) 
Jo 

The above integral can be solved by using the formula 



2n„-ar 2 (2fl - 1)!! Ux 







drr 2n e- ar = v - J -, (6.22) 



Ni(a) = 2 l+ \ — — - — . (6.23) 



which yields 

(21 + 1)!! V vr 

Gaussian basis sets are often expressed in terms of shells [11]. A shell is defined as a set 
of basis functions {(f)} with shared exponents {a}- Examples of shells are s,p, sp, d, spd, 
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An s-shell contains a single basis function with angular momentum I — 0; a p-shell contains 
three basis functions oip x ,p y a,ndp z . Another example is the sp-type shell which is comprised 
of four basis functions s,p x ,p y and p z . For the sp shell the contraction coefficients must be 
the same for all functions of a given angular momentum. The basis sets used in this thesis 
are generally constructed of s, p, sp and d shells. 



where n is the principal quantum number, £ is an exponent and Sj m is a real spherical 
harmonic (see Appendix A. 1.1). Following an identical procedure in Section 6.3.1, the nor- 
malization constant N n (£) is determined by the condition 



6.3.2 Slater Type Orbitals 



The form of the Slater type orbital (STO) is given by 



(6.24) 



1 



(4>nlm\4 ) nlm,) 



(6.25) 




(6.26) 



The integral can be solved by using the formula 




(6.27) 



yielding the result 




(6.28) 
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6.3.3 Numerical Orbitals 

Numerical orbitals refer to any type of orbital on a grid. For a function on a radial grid, 
/(r»), with the endpoints T{ and r/, it is possible to evaluate the function and its derivatives 
at any point r, not just at the grid points {r^} by use of an interpolation function. The 
interpolation function used in qmcPACK and the radial grid HF codes is the one-dimensional 
cubic spline discussed in Appendix A. 3. For a discussion of the different types of radial grids 
see Appendix A. 2. 

Any analytic function, such as a GTO or an STO, can be discretized on a radial grid 
and interpolated. It can actually be faster to evaluate the functions numerically, by use of 
the cubic splines, than analytically. Many electronic structure codes such as the SIESTA 
DFT package [181] and our radial HF code use numerical orbitals. For a discussion the 
SIESTA Numerical Atomic Orbitals (NAO)s see Ref. [182]. Typically pseudopotentials are 
also evaluated on a radial grid. It is also possible to interpolate a three-dimensional grid 
function by use of a three-dimensional cubic spline. Many electronic structure codes use 
three-dimensional grid functions, and it is also possible to discretize the molecular orbitals 
from the Gaussian03 package [11] on a three-dimensional grid. For a discussion of three- 
dimensional cubic splines see Ref. [183]. 

6.4 The Electron-Electron Jastrow 

The electron-electron Jastrow [179] correlates pairs of electrons 



i<j 

where u l J e is a generic function that depends on the relative spins of the electron pair % and 
j, and rij = |r$ — rj\ is the relative electron-electron coordinate. For the sake of simplicity 




(6.29) 
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all of the derivations of this section assume a spin-independent Jastrow function 



U eei r ij) Uee{rij). 



(6.30) 



The electron-electron cusp condition (see Appendix A. 7. 2) must be satisfied by the 
electron-electron Jastrow. For a trial function of the form \1/ = exp(— U ee ), the correct 
cusp condition is: 



1 



$ dr. 



i.i 



dU P 



| for opposite spins 
\ for parallel spins 



(6.31) 



In practice usually only the opposite spin cusp condition is satisfied since like spin electrons 
are unlikely to approach each other by exchange interactions. 

6.4.1 The Derivatives 

To evaluate the gradient and Laplacian the following identities are useful: 



V fc (r. 



ik) 

rik 

Tik, 
Tik 



r ifc 

r ik 
2 

Tik 



(6.32) 

(6.33) 
(6.34) 



and the vector product rule 



V-(/A)=/(V-A)+A.(V/). 



(6.35) 
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The first and second derivatives of the function u ee are denoted by u' ee and u" e , respectively. 
The gradient 



v k u, 



ee 



k-1 



N 



u ee {r ik ) + 22 U ee(r k i 



1=1 



i=k+l 



rki 



i<k 



r ik 

-^2 U 'ee{nk)^; 

.,, ' ik 

i^k 



i>k 



Tki 



(6.36) 



and the Laplacian 



E 
E 

i^k 



-Ke( r ik) + U" e (r ik ) 



- ) ■ (V fc «U(r*)) 



Summing over all yields 



2 E 



r ik 



(6.37) 



6.4.2 The Ratio 



In QMC it is necessary to evaluate the ratio 



(6.38) 
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where R' = (r 1; r 2 , . . . , r' k , . . . ,tn) differs from R = (ri, r 2 , . . . , r^, . . . , r/v) only in the coor- 
dinates of the kth particle. The ratio can be shown to be equal to 

r = exp I ^ KeOifc') - u ee (r ik )} J , (6.39) 



where r ik i — |r, — rCJ. 



6.5 The Electron-Ion Jastrow 

The electron-ion (single-electron) Jastrow correlates the electrons to the ions 

U eI (R)=J2<i(n«), (6.40) 

ia 

where u®j is a generic function that depends on the ion species and r ia — |rj — R a | is the 
relative electron-ion coordinate. Note that the electron coordinates are denoted by r^, while 
the ion coordinates are R a . Besides electron- ion correlations, the single-electron Jastrow has 
been successfully used in quantum dot (QD) calculations where the coordinates R a would 
represent the centers of the QD potentials. 

The electron-nuclear cusp condition (see Appendix A. 7.1) can be satisfied by the single- 
electron Jastrow. The electron-nuclear cusp condition is discussed in great detail by Ma et 
al. [184]. In practice we are only interested in the cusp condition for the s (I — 0) electrons. 
For a trial function of the form \1/ = exp(— U^j), the correct cusp condition is: 



1 



* dri 



dU eI 



dr ia 



Z a , (6.41) 



where Z a is the charge of the ion. This is only necessary when the single-particle orbitals 
in the Slater determinant do not satisfy the correct nuclear cusp condition for the given ion 
species. If the bare Coulomb potential V a (r) = —Z a /r of ion I is replaced by a pseudopo- 
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tential there is no cusp condition at R Q . 

The introduction of the single-electron Jastrow term serves to correct energetically un- 
favorable changes in the density caused by the electron-electron Jastrow correlation term 
[7, 31, 185]. In non-uniform systems the electron-electron Jastrow pushes electrons away from 
regions of high charge density into lower density regions. The introductions of the single- 
electron Jastrow alters the charge density without disturbing the pair-correlation function 



6.5.1 The Derivatives 

To evaluate the gradient and Laplacian of U e j, define the first and second derivatives of u 
as u'^j and u"f, respectively. 



[31]. 




(6.42) 



Q 



and the Laplacian 



E ( V * ■ (^)) + (^) ■ (V*«5(r fa )) 




a 



(6.43) 



6.5.2 The Ratio 



In QMC it is necessary to evaluate the ratio 



r = 



exp(t/ eJ (R')) 
exp(C/ e/ (R)) 



exp(U eI (R')-U eI (R)), 



(6.44) 
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where R' = (r 1; r 2 , . . . , r' k , . . . ,tn) differs from R = (ri, r 2 , . . . , r^, . . . , r/v) only in the coor- 
dinates of the fcth particle. The ratio can be shown to be equal to 



r = exp ( Ki(n>, 

\ a 



<i{r ka )} , (6.45) 



where r k , a = \r' k - RJ. 



6.6 Examples of Jastrow Functions 

The following sections contain different types of functions that are used for the one- and 
two-electron Jastrow correlation factors. 

6.6.1 The Pade Jastrow 

The generic form of the Pade- Jastrow [7] is given by 

air + a 2 r 2 + ■■■ 
M ( r ) = i T u i t 2 i ' ( 6 - 46 ) 

where the most common form has only non-zero a\ and b\ : 



u(r) = — ¥ , (6.47) 



with the derivatives 



«V) = 77^7777 ( 6 - 48 ) 



(1 + br) 2 

-lab 
[l + brf 



= 77^77' (6-49) 
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Figure 6.1: Pade-Jastrow u(r) with a = 1/2 and several different values of b. 

The asymptotic conditions are: 

«(r)| r=0 = 

«'( r )lr=0 = a 

To satisfy the electron-electron cusp conditions the parameter a is set to either —1/2 for 
opposite spin electrons or —1/4 for parallel spin electrons. For the case of a nuclear cusp 
condition a — Z y where Z is the charge of the nucleus. A plot of the Pade-Jastrow for several 
values of the parameter b is provided by Figure 6.1. 

6.6.2 The Cusp-less Pade-Jastrow 

The cusp-less Pade-Jastrow function is ideal for the cases where no cusp condition is required 
such as for harmonic potentials or pseudopotentials: 

u(r) 
u'(r) 

u"{r) 



1 + br 2 
—2abr 

(1 + br 2 ) 2 

2ab(3br 2 - 1) 

(1 + 6r 2 ) 3 



(6.50) 
(6.51) 

(6.52) 
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Figure 6.2: Cusp-Less Pade-Jastrow u(r) with a = 1.0 and several values of b. 
The asymptotic conditions are: 



U ( r )\r=0 



u[r) 



a 




(6.53) 



u'(r)\ r=0 = 0. 



A plot of the cusp-less Pade-Jastrow for several values of the parameter b is provided by 
Figure 6.2. 

6.6.3 The Long-Ranged Jastrow 

This function is generally used for the electron-electron Jastrow for periodic systems [186]: 



u{r) 
u'(r) 

u"{r) 



-[1 — exp(— r/b)} 



2a 



— exp(— r/b) — -exp(— r/b) 
b 

1 - exp(-r/6) - - exp(-r/6) - — exp(-r/6) 



(6.54) 
(6.55) 

(6.56) 
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The asymptotic conditions are: 



u{r)\ r=0 = a/b 

u(r)\ r ^ 00 = a/r (6.57) 
u'(r)\ r=Q = -a/2bi 

The parameters a and b are not independent; to satisfy the opposite spin electron-electron 
cusp condition b = y/a, while for parallel spin electrons b = \r2a. 

6.7 Evaluating the Local Energy 

This section detailed how the local energy is evaluated for a trial wave function \I/t(R)- The 
local energy 

(6 - 58) 

is defined with respect to the N— electron Hamiltonian 

H = -^V 2 + V(R), (6.59) 

where V 2 = Ylf=i * s the N— electron Laplacian and the potential ^(R) normally consists 
of an electron-electron interaction V ee (|rj — rj\) and an electron-ion interaction which is due to 
an external potential V ext (\ri — R Q |). For this section we assume that V ext is a local potential, 
the evaluation of non-local potentials is presented in Section 7.6. 

To evaluate the local energy it is convenient to differentiate the logarithm of the trial 
function 

Vexp(ln^) = Vln^exp(ln^) (6.60) 

= [V(ln*)]W T 
V 2 exp(ln*) = [V 2 (ln^) + (V(ln*)) 2 ]* T . 
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The kinetic energy (KE) part of the local energy 



KE = --Wy X V 2 * T = - j [V 2 (ln^) + (V(ln^)) 2 ] . (6.61] 



can be expressed in terms of the quantities 



1 „ lV 2 f r 1 /V^r\ 2 

T = - V(lntt) = -~ + 7 -r 1 ( 6 - 62 ) 

and 

F = ^ v(h, * ) = ^- (6 ' 63) 

where the relation is given by 

KE = 2T — |F| 2 . (6.64) 

The 3N— dimensional vector F is closely related to the drift velocity Vd = ^^V^t at the 
position R. By use of the Green's Theorem and integration by parts it is possible to derive 
the relation: 

{KE) = (T) = (|F| 2 ), (6.65) 

where the averages ( ) are computed with respect to ^>\. The relationship in Eq. (6.65) can 
be used to check if is sampled correctly and if \&t and its derivatives are being correctly 
evaluated, see Figure 6.3 for an example. 

The gradient and Laplacian of In \I>, when ^r(R) = D(H) exp(— Z7(R)) is Slater- Jastrow 
type, are given by 

VD 

V(lntf) = -VU + — (6.66) 



and 

\7 2 rin ib\ = -X7 2 TJ + - 

D V D J 



V 2 (ln*) = -V 2 t/ + ^-^y, (6.67) 
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Figure 6.3: The VMC estimator of (T) — (|F| 2 ) for the Li2 molecule. 



where V is the 3 N— dimensional gradient. The final expression for the local energy is: 



Er, = -- 



V 2 D 
D 



V 2 U+(VU) 2 - 2VU 



VP 



(6.68) 



Expressions for evaluating the derivatives of the determinant and Jastrow parts (electron- 
electron and electron-ion) of \I/t can be found in Sections 6.2, 6.4 and 6.5. 
For a multi-determinant trial wave function the gradient and Laplacian are: 



T M c D 



Vln* = -VU 



(6.69) 



and 



V 2 ln* 



_ V 2 V , r: =l C n V 2 D n (En =1 CnVD n 



T M c D 



Y M C D 



(6.70) 
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6.8 Optimization of the Trial Wave Function 



6.8.1 The Cost Function and Correlated Sampling 

The Slater- Jastrow trial function ty? is parameterized by the set of variational parameters 
a. The goal of optimization is to find the set ex. that minimizes a cost function. A natural 
choice for the cost function is the expectation value of the local energy 

Ev{a) = /dR**(R;a) ' ^ 

where El(ol) = (a) Hty t(ck) • Minimizing the variance of the local energy 

fdR* 2 T (R;a)[E L (R; a) - E v {cx)f 
^ = /dR**(R;a) (6 ' 72) 

is desirable due the variance minimization principle 

a 2 [H] = (H%> - {H)%, t > (6.73) 

which states that the absolute minimum value of the variance is zero. Since the absolute 
minimum of Eq. (6.72) occurs when \&y is any eigenfunction of H, variance minimization is 
suitable for excited states as well. In practice excited state wave functions are usually not 
optimized. It is also possible to optimize a linear combination of Ey and a^, but it should 
be noted that the energy and variance have different units. 

A variant of Eq. (6.72) involves minimizing the second moment of the local energy with 
respect to a fixed reference energy Er : 



2 /rfRg£(R;qQ [E L (R; a) - E R ] 

= /dR**(R;a) ' ^ 
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where the constant E R should be close to the energy of the state being sought. 3 
In general it is possible to minimize any function of the form 

JdRPjR; a)f(E L (R;a)-E R ) 

JdRP(R;cx) ' [b ^ b) 

where P is a probability distribution and / is a non- negative function such that /(0) = 
[187]. qmcPACK can optimize the class of functions 

f(E L (R; a) - E R ) = \E L {R] a) - E R \ n , (6.76) 

for any integer n > 1 over the distribution P = ^(a). 

The cost function is computed as an average over a finite set of configurations generated 
by VMC. A new set of configurations is required each time ct is changed in the optimiza- 
tion process; this is necessary to correctly sample ^(at). By using the correlated sampling 
approach (Section 4.8), only a single set of configurations from the distribution ^x(, a o) 2 
is required, where cko is the initial parameter set [155]. The cost function in Eq. (6.74) is 
evaluated as: 

2 / dR g|(R; a)w(R; a) (E L (R; a) - E R f 



where the weights are 



w(R; a) = * T ffi°° v (6.78) 
W T (R; a ) 



and ck is the current parameter set. The discrete approximation to Eq. (6.77) is given by 



2 _ ESi w(gjj <*) (^l(R»; a) - gg) , R 7m 

J2 i=1 w[Ri; oc) 

where the number of configurations M is usually of the order 10 3 — 10 5 . 

An improved method for optimization when calculating extrapolated estimators (Sec- 



Minimizing Eq. (6.74) is equivalent to minimizing a linear combination of Ey and a\, for a proof see 
pg. ofRef. [7]. 
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tion 4.10.4) is to maximize the overlap between the trial wave function and the projected 
wave function $ (J3 -> oo) within DMC [188] 

(6.80) 

[/dR*|(R;a)] 1/2 

The actual minimization of the cost function can be carried out by using standard opti- 
mization techniques. The most common methods are conjugate gradient, steepest descent 
and simulated annealing. The default method of qmcPACK is the conjugate gradient algo- 
rithm from Numerical Recipes [189]. Algorithms that require analytic evaluation of the cost 
function and its derivatives are avoided. The parameters a are varied until the convergence 
condition on the cost function is satisfied. It may be necessary to introduce constraints on 
a to prevent divergences in trial function ^ T (a) and the local energy E L (ct). 

6.8.2 Optimizing the Jastrow 

Usually only the Jastrow part of the Slater- Jastrow trial wave function is parameterized: 

* T (R; a) = D(R) exp[-C/(R; a)]. (6.81) 

When this is the case the weights, Eq. (6.78), are determined by the ratio of the Jastrow 
factors: 

w(R; a) = exp[-C/(R; a) + U(R; a )]. (6.82) 

Only the terms in the local energy, Eq. (6.68), involving the Jastrow need to be re-evaluated 
during the optimization. For an example of the optimization of the Jastrow for the Ge atom, 
see Figure 6.4. 

In general it is possible to optimize the single particle orbitals in the Slater determinant if 
they are parameterized. This is idea behind the correlated geminal approach used by Casula 
et al. [190]. Also it is possible to optimize the coefficients of the Slater determinants for 
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Optimization Steps 



Figure 6.4: Optimization of the ground-state of the Ge atom ( 3 P) using a cost function 
which a linear combination of Eqs. (6.71) and (6.72). The optimizable parameters are in 
the one- and two-electron Jastrow functions; for details regarding the trial wave function see 
Section 8.2. a) Plots the energy for each step of the optimization and b) plots the variance of 
the local energy. The optimization algorithm used in this example was conjugate gradient. 
All units in a.u. 

multi-determinant trial wave functions. 

6.8.3 Optimization in the Presence of a Non-Local Potential 

For a local potential, the potential energy (PE) is independent of the trial wave function 
and depends only on the positions of the particles. In contrast, for non-local potentials, the 
potential energy depends explicitly on the trial wave function, see Eq. (7.20). Typically the 
non-local potential is split into a local part common to all angular momentum channels and 
an angular-momentum dependent correction to the local part. Since the non-local corrections 
only constitute a small part of the total energy, it is not necessary to recalculate the potential 
part of the local energy during the optimization. This has the effect of saving a considerable 
amount of computation time [191]. 
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6.8.4 Numerical Instability 

The optimization procedure outlined above is often successful, but can be numerically unsta- 
ble. The optimization can reach a minimum for a parameter set at that gives worse results 
for subsequent QMC calculations than the original parameter set olq, in terms of both the 
local energy and the variance of the local energy. The origin of this problem can be traced to 
the presence of walkers acquiring a large weight, usually only a single walker, which ruin the 
minimization [187]. In principle this problem could be remedied by increasing the number of 
walkers, but this is often impractical and the number required can be prohibitively large. A 
method for dealing with the presence of bad walkers in the finite walker set is to determine 
the effective number of walkers 



where the total number of walkers is N. If all the weights are close to unity the effective 
number of walkers is close to N, N e ff ~ N : in the extreme case where all the weight is 
concentrated in one walker, the effective weight is close to unity, N e ff ps 1. It is advisable to 
terminate the optimization if N e ff is less than a cutoff, which is usually around 80% of the 
total number of walkers. The problem can usually be traced to using a poor guess for cto or 
an insufficient number of walkers. 

Numerical instability in the weights is especially prevalent for systems containing large 
numbers of electrons such as molecular clusters and bulk materials. We found this to be 
directly relevant to the Hydrogen passivated Si and Ge nanoclusters larger than Si2H 6 or 
Ge2H6- Fortunately, in the variance minimization scheme it is possible to alter the weights 
while not affecting the positions of the minima. Constraining the weights of the variance to 
be unity [192, 31, 51] (w = 1) yields 



N, 



eff - 



(6.83) 




/ dR gfffi op) [E L (R; a) - E c (cx)] 2 
j dR ^! 2 T (R; a ) 



(6.84) 
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where the un-weighted energy is 



EMol) 



J dR #|(R; a )E L (R; a) 
J dR ^! 2 T (R; a ) 



(6.85) 



The absolute minima of Eq. (6.84) at zero occurs if ^t{ol) is an exact eigenfunction of H, 
because for an exact eigenfunction El = Eq- Therefore, the absolute minima of Eqs. (6.84) 
and (6.72) are at the same positions, but Eq. (6.84) is advantageous since it has a lower 
variance. The function Ec does not share this property, the ground-state of H is not guar- 
anteed to be the minimum value of Ec- For an example of constrained weight optimization 
of the Ge2gH 36 nanocluster, see Figure 6.5. 




1000 1500 
Optimization Steps 



Figure 6.5: Optimization with constrained weights of the ground-state of the Ge2gH36 nan- 
ocluster using the variance as the cost function. The optimizable parameters are in the 
one- and two-electron Jastrow functions; for details regarding the trial wave function see 
Section 8.2. a) Plots the energy for each step of the optimization and b) plots the variance 
of the local energy. 
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6.8.5 Outline of the Algorithm 

This is the outline of the algorithm to optimize a trial wave function ^(a), see Algorithm 2. 
As a first step it is always necessary to generate a set of walkers from VMC to sample the 
initial distribution ^(cko) 2 - 

It is always advisable to first perform an optimization on a simpler problem before pro- 
ceeding to the more difficult problem under consideration. For example, to optimize a small 
molecule such as GeH 4 it is recommended to first optimize the trial wave functions for 
the constituent Ge and H atoms. The atomic results can be used as a good initial guess 
for trial wave function for GeH4. To optimize a larger Ge structure, such as GesH^, it is 
recommended to use as an initial guess the results from the optimization of GeH 4 . 

Algorithm 2 Optimization of the trial wave function ^(a). The value of x used to deter- 
mine the cutoff is usually chosen to be 0.8. 

Generate a set of N configurations from the distribution ^(cuo) 2 - 

repeat 

Evaluate the Cost function (For example Eq. (6.77)). The weights are either evaluated 
by Eq. (6.78) or are set to 1.0. 

Calculate the effective number of walkers N e ff, Eq. (6.83). 
if N eff < xN then 

Stop the optimization, 
end if 

Adjust the parameters using algorithm of choice: oli — > a i+ i 

Update the trial function: \J/ T (a:j) — ► ^(oij+i). 
until Convergence criteria of the algorithm is satisfied. 
If (Xi differs significantly from ctQ it is advisable to rerun VMC for ^(aij). 
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Chapter 7 

Core- Valence Partitioning 



7.1 Introduction 

Core-valence partitioning schemes allow electronic structure calculations only on valence 
electrons by quantifying the effects the core electrons have on the valence electrons. Such 
schemes are successful because the outer valence electrons determine most of the chemical 
properties of atoms and molecules while the core electrons are relatively inert. The most 
common implementation of core-valence partitioning is by a pseudopotential, or effective 
core potential. Pseudopotentials are desirable because they allow the elimination of the 
core electrons, hence reducing the effective atomic number, and feature valence orbitals that 
are smooth and node-less. In the context of many-body methods such as Quantum Monte 
Carlo (QMC), pseudopotentials enable calculations that would otherwise be unfeasible. The 
scaling of QMC algorithms with respect to the atomic number Z is highly nonlinear because 
shorter time steps and longer runs are required for the higher energy core states. Estimates 
vary [33, 7] from Z 5 - 5 to Z 6,5 for all-electron calculations. Typically pseudopotentials are 
constructed within single-body methods such as density functional theory (DFT) in the 
local density approximation (LDA) or Hartree-Fock (HF), which means that the core- valence 
interactions are treated at the same levels. While this approach is sufficient for most elements 
in the periodic table, where core-valence interactions are negligible, it is unsatisfactory for 
post-transition elements such as Ge which has shallow and easily polarizable 3d core. 

The salient issue is that the accuracy of QMC applied to heavy atoms such as Ge [Z = 32) 
is limited by the treatment of the core-valence interactions. No matter how accurate the 
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QMC calculations for the valence electrons, the accuracy is limited by the pseudopotential. 
Therefore, an improved method of core-valence partitioning is necessary for applications of 
QMC to elements such as Ge. 

Shirley and Martin [43] have implemented a many-body version of core-valence parti- 
tioning based on the core polarization potential [40, 41, 42] (CPP) approach. Their parame- 
terized CPPs are fit to quasiparticle self-energies, which include many-body effects such as 
core-relaxation and polarization and core-valence correlation. This is significant for alkali- 
metals and post-transition elements such as Zn, Ga and Ge with shallow cores, core-valence 
correlations cannot be ignored. Specifically, the CPPs are purely ah initio and correct a HF 
treatment of core- valence interactions [43]. 

The effect of core- valence and intershell correlation is especially pronounced on the band 
structure of Ge. Shirley, Zhu and Louie [9] found significant improvement in the theoretical 
quasiparticle band energies of Ge when core-valence correlation was included via a CPP, 
see Table 7.1. Their CPP based results predicted the correct ordering of the closely spaced 
conduction band states, including the V — L indirect band gap. The results based on an 
LDA treatment of the core- valence interactions were not nearly as accurate, despite the fact 
that the valence-valence interactions were treated at the same level of accuracy. 1 

For the Ge2H 6 molecule, Nicklass and Stoll (NS) [195] determined that core-valence cor- 
relation is a significant effect in determining the geometry, especially the Ge-Ge bond length. 
NS were able to account for the influence of intershell correlation by including the 3d or- 
bitals in the active space of all-electron CCSD calculations. These results were corroborated 
by pseudopotential ones, where the core-valence correlation effects were modeled by a CPP 
[196, 197]. 

: Ku and Eguiluz [193], in a paper reporting all-electron quasiparticle calculations for Ge, have attacked 
the work of Shirley et al. [9] due to their treatment of the core electrons. The authors of the earlier work 
have replied with criticism that Ku and Eguiluz did not treat the valence-valence correlations well enough 
[194]. 
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Quasiparticle Results 
Core- Valence Interactions: 







LDA 


LDA 


HF+CPP 


Expt. 


- 


-r 7c 


-0.26 


0.53 


0.85 


0.89 


- 




0.55 


1.28 


1.09 


l.lOf 


- 


L§ c 


-0.05 


0.70 


0.73 


0.744 


^8v - 




0.60 


0.58 


0.36 


0.36 



Table 7.1: Comparison of band energy differences for Ge from LDA and quasiparticle results 
to experiment from Ref [9]. The quasiparticle results feature two different treatments of the 
core- valence interactions: LDA and a CPP based treatment. The fundamental gaps are in 
bold. All energies in eV. Experimental results from Ref. [10] unless otherwise noted, 
f From Ref. [9] and references therein. 

7.2 Introduction to Pseudopotentials 

It is possible to remove the core electrons by introducing a pseudopotential (PP), or an 
effective core potential according to quantum chemists. The use of pseudopotentials is 
justified since chemical properties, such as bond strengths, ionization energies and molecular 
geometries are largely determined by the valence electrons: the core electrons are spectator 
electrons and the core states retain their atom-like character. The disparity between the 
energy scales of the core electrons and valence electrons provides further justification for 
this decoupling [198, 40]. 

The effect of using a pseudopotential for a given atomic species is to reduce any electronic 
structure calculation to only the valence electrons. For the case of Ge, this reduces the 
calculation from 32 electrons to only 4 valence electrons: 

(ls) 2 (2s) 2 (2p) 6 (3s) 2 (3p) 6 (3rf) 10 (4s) 2 (4p) 2 -> (4s) 2 (4p) 2 . (7.1) 

Before going any further it is important to discuss an important aspect of pseudopoten- 
tials known as transferability. For a pseudopotential to be truly useful, it must not only 
mimic the atoms full-atomic valence properties for a given atomic reference configuration, 
but should also be transferable to different chemical environments, such as excited atomic 
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configurations, or more importantly molecules and solids. 

Our interest is mainly confined to a class of ab-initio pseudopotentials known as norm- 
conserving pseudopotentials (NCPP)s. NCPPs are constructed to be equal to the all-electron 
(full-atomic) potential outside the core region, but are shallow and are typically angular mo- 
mentum dependent inside the core region. The angular momentum dependence allows the 
pseudopotential to bind pseudo-valence orbitals with eigenvalues equal or nearly identical 
to the full- atomic eigenvalues for each angular momentum channel of interest. In addition, 
NCPPs are constructed to have nearly identical scattering properties as the full-atomic po- 
tential over a reasonably wide energy range for each angular momentum channel of interest. 
As we will see these properties make NCPPs highly transferable. 

Due to the angular momentum dependence, the pseudopotential takes the form of a 
non-local operator; the semi-local form is given by 



where Y\ m is a spherical harmonic and V\{j) is a radial function. The pseudopotential is 
termed as being semi-local because it is only non-local in the angular variables (6, 0), but it 
is local in the radial variable. 

7.2.1 Norm-Conserving Pseudopotentials 

Norm-conserving pseudopotentials feature valence pseudo-orbitals that are smooth and node- 
less, and when normalized become identical to the all-electron orbitals beyond a core radius 
r c . For an excellent review of NCPPs see Refs. [199, 200, 201, 202, 203, 29, 204]. 

Hamann, Schliiter and Chiang (HSC) [200] listed a set of criteria for good NCPPs. (It 
should be noted that the principle quantum number n is omitted; the all-electron orbital is 
designated ip AE and the pseudo oribital is i\) pp . Also, it is assumed that the valence pseudo 
orbitals satisfy the usual orthonormality conditions: ip pp \ip pp ) = 5ij.) 




(7.2) 



i a i 
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The HSC criteria: 
A. The all-electron and pseudo eigenvalues agree 



AE PP 



(7.3) 



B. The normalized all-electron and the normalized pseudo orbitals agree beyond the radius 



r c , which is known as the core-radius 



i ) i LE { r ) = ' l Pi' F ( r ) f° r f > r c . 



,pp 



(7.4) 



C. The partial- norm integral from to r c , i.e. the integrated charge density, of the all- 
electron and pseudo charge densities agree beyond r c . 



drr 2 \^f E (r)\' 



drr 2 \tf p (r)[ 



(7.5) 



This is the norm conservation condition. 

D. The logarithmic derivatives and the first energy derivative of the logarithmic derivative 
of the all-electron and pseudo wave functions agree at r c 



d 
dr 



d d 
de dr 



In %j) 



pp 



d_ 
dr 



In 



AE 



In ij} 



pp 



d_d_ 

de dr 



In ij} 



AE 



(7.6) 



(7.7) 



The radial logarithmic derivative is related to the scattering phase shift. 

From properties 1 and 2 if follows that the NCPP equals the AE potential outside the 
core region because the potential is uniquely determined by the wave function and the energy 
e (not necessarily an eigenvalue). Property 3 guarantees, through Gauss theorem, that the 
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electrostatic potential produced outside of r c is identical for the all-electron and pseudo 
charge densities. This property holds despite the fact that ipf p and ijjf E differ inside the 
core region. Property 4 guarantees that the PP and AE potential have the same scattering 
phase shifts (at r c ) to linear order in energy around e^. This is important because changing 
the chemical environment shifts the eigenvalues away from the atomic energy levels from the 
reference configuration. 

Properties 3 and 4 are crucial for the pseudopotential to have the optimum transferability 
and are related by the important identity 



where ip is a solution to the radial Schrodinger equation with the energy e (not necessarily an 
eigenvalue). Eq. (7.8) implies that if ipf p and ipf- E have the same magnitude at r c and the 
same integrated charge density inside a sphere of radius r c , then the first energy derivative 
of the logarithmic derivative are the same for ipf p and ipf E . For a proof of the identity in 
Eq. (7.8), see Appendix A. 8. In addition Shirley et al. [205] observed that for NCPPs the 
total charge inside r c divided by ip(r c ) 2 , 



is the same for ip and ip . 

7.3 Generating Pseudopotentials 

Pseudopotentials are normally constructed to reproduce the all-electron results of the valence 
electrons of an atomic reference configuration: such as eigenvalues, partial-norms and the 
radial logarithmic derivatives. Typically the reference configuration is the ground state of 
the atom, although it is possible to use an excited state configuration or even an ionized 




(7.8) 




(7.9) 
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state. There are many different procedures for generating norm-conserving pseudopotentials. 
One of the most popular and widely used schemes is by Troullier and Martins [206]. 

One of the most difficult parts of generating any pseudopotential is in choosing the 
matching radii r c for each of the angular momentum channels. For NCPP, r c must be larger 
than outermost node of the orbital - this represents a lower bound for r c . Generally r c is 
chosen to be near the outermost peak, the larger the r c the softer the pseudopotential, 
but this comes at the price that the pseudopotential is less transferable to different chemical 
environments [204, 199]. The softness of a pseudopotential refers to how many basis functions 
or plane waves are required to describe the pseudopotential. 

7.4 The Germanium Pseudopotential 

For Ge we use a relativistic Hartree-Fock (HF) pseudopotential from Ref. [12] to replace 
the core electrons, including the 3<i-shell. This pseudopotential was used for all of our 
Gaussian 03 and QMC calculations unless otherwise specified. We have also created a local 
pseudopotential for Ge which is discussed in Chapter A.ll. The pseudopotential reduces 
the calculation for each Ge atom from 32 electrons to only four valence electrons: 4s 2 4p 2 . 
The main reason for this choice of pseudopotential is that the SM CPP was developed as 
an addendum to HF based treatments of the core-valence interactions. In addition, there is 
also a growing body of work that shows that HF pseudopotentials perform better than DFT 
pseudopotentials in QMC [207], but it should be stated that this work is not conclusive. 

The details in the generation of the pseudopotential can be found in Refs. [12, 208] and are 
briefly summarized in Ref [209] . The pseudopotential was generated in the Troullier- Mart ins 
[206] scheme. For use in non-relativistic calculations it is necessary to perform a j weighted 
averaging. This is because the valence electrons are specified by the quantum number k; the 
relationship between K,j and / is summarized in Eq. (2.114). Therefore there are different 
pseudopotentials V K with the same I values. Taking the average of the pseudopotentials with 
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the same I quantum numbers weighted by the different j degeneracies leads to: 



Vi = zf-iW*=i + + l)K=-i-i]- (r.io) 



The spin-orbit coupling can be included separately. 

It was necessary to remove the "extreme non-locality" of the pseudopotential by slightly 
relaxing the norm-conservation condition. The extreme non-locality refers to the long-range 
behavior of the pseudopotential: the pseudopotential is non-local over all space and deviates 
from the Coulomb —Z/r potential in the r — ► oo limit. This is a consequence of the non- 
locality of the exchange interaction in Hartree-Fock (Dirac-Fock) . Figure 7.1 plots the Ge 
pseudopotential. 

7.5 Pseudopotentials in Gaussian 

The form of the pseudopotential used in the Gaussian 03 [11] package 

V PP = Viae + Vnl, (7.11) 

where Vi oc is the local component and Vnl is the non-local correction to Vi oc . The pseudopo- 
tential is represented in terms of a Gaussian expansion by: 

M 

V loc (r) = J2r nk c k e- a * r \ (7.12) 



fe=i 



and 



Imax 



Vnl = J2 



1=1 



Mi 

(7.13) 



_fc=i 

where the Ck are expansion coefficients and the operator Pi = \Yi m )(Yi m \ is the angular 
momentum projector operator. 
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7.6 Pseudopotentials in Quantum Monte Carlo 

7.6.1 Introduction 

Current QMC methods are largely intractable for heavy atoms due to the poor scaling of 
the computation time with respect to the number of electrons. The core electrons contribute 
most to the total energy and require smaller time steps to properly sample, due to the small 
variations of the wave function near the nucleus. For an atom of atomic number Z, estimates 
for the scaling range from Z 5 ' 5 [33] to Z 6 ' 5 [7] for all-electron calculations. Recently Ma et. 
al [36] estimated the scaling to be of the order Z 5 ' 47 . 

7.6.2 Evaluating Non-Local Pseudopotentials in QMC 

Excellent references for implementing Non-Local Pseudopotentials (NLPP)s in QMC can be 
found in Refs. [153, 210, 211, 31, 7]. For the sake of simplification the system under consid- 
eration is a single atomic core located at the origin. The results can easily be generalized to 
multiple atomic cores located at the positions {R Q }. The generic form of a non(semi)-local 
pseudopotential included in the valence only Hamiltonian for an atom, from section 7.2, and 
repeated here is 



It is conventional to divide the pseudopotential for an atom into a local part VJ oc , common for 
all angular momentum, and a non-local correction to V\ oc for each angular momentum channel 
Vnl,i- The local part is typically chosen to be the largest angular momentum component, 
and includes the long range —Z e ff/r interaction, where Z e ff is the effective atomic charge. 
The non-local corrections are relatively short ranged, with an extent of only a few bohr radii 




(7.14) 



lm 
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around the atomic center. We now write the NLPP as 



V pp = Y,^oc(^) + \ Y irn)(Vi(r l )-V loc (r i ))(Y lm \] 

i 

= J2\ V ^( r i)+V NL L (7.15) 



The effect of the non-local operator Vnlj on an arbitrary function of r\ is 

*Wfa) = J>(n)*W©i) I d^'i PWW/tt), (7.16) 

l,m J 

where Qi is the angular part of the spherical coordinate r-j and v\ = Vi — Vl oc . The effect 
of the angular momentum integral is to project the different angular momentum channels 
(s,p, d, . . .) out of f(ri) : each channel feels its own effective potential. 
The contribution of Vnl to the local energy E L = V^HVt 

is 

i i 

where the contribution from the zth electron is 

V NLti = $>(r,) £ Y lm (^i) \ dn[ [YUm l T{ ^'''^'''' rN \ (7-18) 

The angular integration is over a sphere centered at the atomic core and passing through 
the ith electron. Eq. (7.18) can be simplified by noting that Y/ m (0,0) = for m ^ 0, and 
orienting the z-axis along r-j to yield 

V NLji = ]>>(n)>l (0,0) / dflj [Y lm (m l T ^--- r }---l N \ , (7.19) 
*Y J *T(ri, ...,ri, ...,r N ) 

here Q r > is the angular part of the vector r' when r points along the z— axis. It is convenient 
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to rewrite Eq. (7.19) in terms of the Legendre polynomials 



Pi [cos 7] 



2/ + 1 



J2lYim(e',4>')}*Yi m (d,4>), 



m=—l 



where 7 is the angle between the vectors r and r', 



Vnl,< = £ 



21+ 1 
Air 



vfc) / dn , i P l [cos(9' i )] 




(7.20) 



• • • j Tj, . . . , rjv) 



The r' dependence of ^ has a primarily angular momentum character of the orbitals in 
the Slater determinant. The Jastrow correlation part of ^>t also has an angular momentum 
dependence and may introduce higher angular momentum components than the determinant 
part. 

In principle, the expression for the energy due to electron % and the atom, Eq. (7.20), needs 
to be summed over all the electrons to give the correct total non-local energy in Eq. (7.15). 
However, since the non-local potentials are short ranged only the electrons within a sphere 
determined by the maximum extent of vi(r) need to be included in the summation. 

The angular integral in Eq. (7.20) must be performed numerically. A quadrature rule on 
a sphere is chosen such that the integration of products of spherical harmonics is exact up 
to a maximum angular momentum l max such that 



for / < l max . In the above equation the N grid points on the sphere are and their 

associated weights {Wj}. 

It is also necessary to perform a random rotation of the grid every time the integral is 
evaluated. The coordinate axes are rotated such that the z— axis has a uniform solid angle 
distribution. This has the effect of ensuring that the estimator of any function evaluated on 
the sphere is unbiased [153]. The rotation of the coordinate system is defined by the Euler 



N 



Y,W i Y^ i )Y Vm {n i )=5 w 5 [ 



(7.21) 



i=i 
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Table 7.2: The spherical grid points and their associated weights: the grid points correspond 
to the vertices of a regular icosahedron (12 points) on a sphere of radius unity. This is the 
grid used to evaluate the non-local pseudopotentials in this thesis, including Ge and Si. 
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angles, which can be found in Ref. [212]. 

For the Ge pseudopotential the appropriate grid is constructed from the vertices of a 
regular icosahedron (12 points) with constant weights [153]. This grid, see Table 7.2, is 
exact for integration of spherical harmonics with / < 5. 



7.6.3 The Consequences of Using Non-Local Pseudopotentials in 
Diffusion Monte Carlo 

The inclusion of a non-local pseudopotential within DMC results in a sign problem analo- 
gous to the fermion sign problem (see Section 4.11.1). The origin of this problem can be 
traced to the imaginary time Schrodinger equation for a Hamiltonian containing a non-local 
pseudopotential, H = T + Vl oc + V^l- The corresponding Greens function in the short time 
approximation has a term of the form (R'| exp(— tVjvz, )|R). The relative sign of this term 
depends on the values of R', R and r. Consequently, the Greens function can no longer be 
interpreted CIS db transition probability density. 

The importance sampling transformation, Eq. (4.64), was made under the assumption 
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that the potential V is a local function. The transformation for the case where V includes 
a non-local part: 



> 2 



hoc i - vtvl — -Et 



$(R, r) 



(7.22) 



can be shown to reduce to 



-MM = -V/(iM) 

at 2 v y 



V ■ [V D (R)/(R,t)] + [E L (R) - Br] f(R,t) 
V nl $CR,t) V nl ^t(R) 



$(R, r) 



/(R,r), 



(7.23) 



where /(R, r) = \&r(R)$(R, r). The last term in curly brackets includes the operation of a 
non-local potential on an unknown wave function and represents a non-local branching. In 
the locality approximation [210] this term is neglected such that Eq. (7.23) becomes identical 
to the original importance sampling transformation derived for local potentials 



■^5^1 = -Iv 2 /(R, t) + V • [V D (R)/(R, t)} + [£x(R) - E T ] /(R, f). (7.24) 



The DMC solution to Eq. (7.24) is the stationary distribution 



lim/(R,r) = tt T (R)$ A (R), (7.25) 

T-+OO 

which is used to determine the mixed estimate of the energy Ea (see Section 4.10.4). The 
wave function <3> A and the corresponding energy Ea are exact in the limit where the trial 
function is equal to the exact ground state eigenfunction $o> since as the term in 

curly brackets approaches zero. An unfortunate consequence of the locality approximation 
is that the energy Ea is no longer variational; Ea is not an upper-bound to the exact energy 
Eq, but is lower than the variational energy. 
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Table 7.3: Test results for VMC calculations of atoms and molecules featuring pseudopo- 
tentials. The trial wave function for VMC is identical to the HF wave function and does 
not include a Jastrow correlation factor. All HF results are from the Gaussian03 package 
[11]. The Ge pseudopotential from Ref. [12] is non-relativistic; the atomic basis is the uncon- 
tracted set 6s6pld. The C pseudopotential and basis are from Refs. [13, 14]. The methane 
molecule uses the same C pseudopotential and basis as the C atom. The H potential is the 
bare — 1/r and the H basis set Ylslp is contracted to 3slp. The C-H bond length is 2.06738 
Bohr. The second column is the number of core electrons replaced by the pseudopotential 
(Ncore) and the third column is the total number valence electrons (Ny a / ence ). The reason 
why the error bar for the Ge calculation is smaller than the calculations involving C is that 
the Ge pseudopotential has l max = 2, while C has l max = 3. All energies in a.u. 
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-5.32233 
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-3.628939(89) 


CH 4 


2 
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3 


-7.83536 


-7.83526(73) 



7.7 Core Polarization Potentials 

The physics of the core polarization potential (CPP) can be traced to valence electrons 
inducing a core polarization and feeling the induced potential. The energy of a core-dipole 
induced by an electric field acting on the core is added to the valence Hamiltonian H va [. 
In a point dipole picture, the electric field acting on core J can be written in terms of the 
positions of the cores {Rj} and the valence electrons {r^}: 

^ = E T ^-Y:w Lz ' = ^+< < 7 - 26 > 

where rj, = — Rj| is the electron-core distance and Rjji = |Rj/ — Rj| is the core-core 
distance. The full CPP is given by 



Hcpp = --^a/Ej-Ej, (7.27) 

j 
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where aj is the static polarizability of core J. Inserting Eq. (7.26) into Eq. (7.27) yields 



n 



CPP 



r ji ■ rjj 



R 3 

i J T 



Ji- LL JJ 



R 3 



n 



CPP 



\ E ^ [ V e + V e-e + V e-I + Vf_ t ] . 



(7.28) 



(7.29) 



The first two terms represent a single and two-electron interaction, respectively, and will be 
discussed in more detail. The third term reflects the indirect interaction of a single valence 
electron and another core. For a valence electron close to core J',rji m Rjj', therefore the 
third term is repulsive in character [40] . Finally, the last term is a pure core-core interaction 
term and is a constant. 

The core dipoles are not point dipoles so appropriate corrections must be made when 
valence electrons penetrate the cores. The electric field acting on core J due to the valence 
electrons, E} in Eq. (7.26), is truncated at close proximity to the core center by a cutoff 
function. Shirley and Martin implemented the function introduced by Miiller, Flesch and 
Meyer [40] in their study of alkali and alkaline earth atoms: 



f(r) 



1 - e 



-(r/f) 2 



(7.30) 



where f is an adjustable parameter. This function approaches zero at the origin and has no 
effect at long range. 

The form of the one-electron term used by SM has an explicit angular momentum de- 
pendence 

(7.31) 



where I is the angular momentum and P l is the projector of the /th angular momentum of 
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electron i with respect to the Jth ion core. At long range this term has the form — a/2r 4 , 
as required by Born-Heisenberg result [213] for the interaction of an electron and a dipole of 
polarizability a. The parameters {f l j} are fit for the s, p and d channels by forcing V l e and 
the self-energy, calculated in a generalized GW approximation, to have the same expectation 
values for the lowest valence states of each angular momentum in bare-core configurations. 
For one electron outside a core the valence properties are determined by the one-electron 
Green's function, which is a solution to Dyson's equation. All core-valence interactions 
beyond the HF level are incorporated into the self-energy: the self energy corrects Koopman's 
eigenvalues. 

The two electron term has an analogous form 

Ve-e = ~£ a 'E (-) f (^) . (7-32) 

2 j j£ \rjj \rj J 

where f is assigned the average of the s and p channels 

f = (7.33) 

for s — p bonding systems. This term screens the dipole exchange [43] and reduces the 
valence-valence correlation effects because correlated electrons tend to be on opposite sides 
of a given core due to their Coulomb repulsion while the factor rj.; ■ r jj discourages such 
correlation. 

7.7.1 Core Polarization Potential for Germanium 

The evaluation of the SM CPP can be greatly simplified by making the V e term local. This is 
justified for Ge since the cutoff radii {f 1 } for the s, p and d channels are approximately equal, 
see Table 1 in Ref. [43] for the CPP parameters for use in relativistic HF pseudopotentials. 
For Ge we use the core polarizability a = 0.7732 and a single cutoff radius, f = 0.9063, which 
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is the angular momentum weighted average of r° and r l . The CPP can now be evaluated by 
inserting the effective electric field 




(7.34) 



into Eq. (7.27) 2 . The inclusion of the CPP within QMC has little effect on the computation 
time since most of the required quantities are already necessary to evaluate the trial wave 
function and the electron- ion and electron-electron interaction energies. Figure 7.2 plots the 
V e term of the CPP. 




0.5 1 1.5 2 2.5 3 

r(a.u.) 

Figure 7.2: The V e term of Hcpp (see Eq. (7.31)) for the Ge atom. 



7.7.2 Core Polarization Potentials in QMC: Previous Work 

CPPs have successfully been included in QMC by several authors. The first known use of 
a CPP in QMC was by Shirley et al. [44] for calculations of the ionization potential and 
2 A more general relation for the case when Ticpp is semi- local can be found in Rcf. [45]. 
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electron affinity of the Na atom and the bond length and disassociation energy of the Na2 
dimer. Maezono et al. [45] also studied Na, replacing the Na + core with a relativistic HF 
pseudopotential and modeled the core-valence correlations by the SM CPP. The inclusion 
of the CPP greatly improved the ionization energy of the atom, while there was marginal 
improvement for the cohesive energy of the solid. For the periodic system the electric field 
was evaluated as an analytic derivative of the Ewald potential. Alfe et al. [46] estimated that 
the inclusion of the CPP reduced the pressure induced phase transition from the diamond 
to (3— tin structure of Si from 17.7 GPa to 16.5 GPa. Lee and Needs [47] developed their 
own parameterized CPPs for Si and Ti that use a different cutoff function f(r) and are 
angular momentum independent, in contrast to the SM CPPs. This work was motivated 
by an earlier study of Ti that found the SM CPP is much too attractive in the d-channel, 
and leads to worse results in valence correlated wave function calculations than if the CPP 
was neglected [207]. The CPP parameters were fit such that the inclusion of the CPP to a 
relativistic HF pseudopotential reproduced the core-valence correlation energies calculated 
using a multiconfiguration HF (MCHF) method. Only double substitutions from the HF 
reference state are included, consisting of one core and one valence substitution. Systematic 
comparisons between the Lee and Needs CPP [47] and SM CPP for atomic excitations 
found that the SM CPP performed slightly better in Si, while in Ti the Lee and Needs CPP 
worked much better. Finally, in a study that is directly relevant to this work, Porter et al. [26] 
included the CPP in DMC calculations of the optical properties of the Si n H m nanoclusters, 
for (n + m) < 34. 
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Chapter 8 



Results for Atomic Systems: Effect of 
the CPP on Atomic Excitations of Ge 

8.1 Introduction 

Quantifying the effect of core- valence correlation on atomic excitations is a natural starting 
point for examining the role of core- valence correlation on the excitations of larger Ge struc- 
tures, including the Ge n H m molecules and nanoclusters and even bulk Ge. Core-valence 
correlation is incorporated by the SM CPP which for atoms includes the single-electron V e 
and electron-electron V e - e interactions in Eq. (7.29). These calculations are also useful in 
determining what role, if any, the relative occupation of the s and p atomic orbitals has on 
the size of the core-valence correlation energy. 

Farid and Needs [214] have performed calculations for the first four ionization energies 
of Ge and calculated the total energy of the pseudoatom. The purpose of these calculations 
was to compare the quality of two norm-conserving relativistic LSDA pseudopotentials, one 
including non-linear core-corrections and the other one not, to all-electron and experimental 
results. The quality of the pseudopotential results was found to be much poorer for Ge than 
for Al, Si and C; for the other elements in the study the pseudopotentials including the 
non-linear core corrections were found to produce more accurate results. It should be noted 
that older experimental values, which can be found in Ref. [15], are used for comparison in 
this paper. 

Shirley [17, 134] has performed HF and GW calculations on Ge; the parameterized CPP 
is fit to GW results for a single valence electron of each angular momentum channel outside 
a bare core. 
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8.2 Computational Method 

The Quantum Monte Carlo (QMC) results are from the Variational Monte Carlo (VMC) 
method and the Diffusion Monte Carlo (DMC) method within the fixed-node approximation 
(FN-DMC) as discussed in Chapter 4. VMC explicitly includes correlation by using a many- 
body trial wave function \P<r- Fixed-node DMC is a stochastic method that uses the imaginary 
time propagator e~ rH to project the lowest energy state $o consistent with the nodal-surface 
of a trial wave function \DV- The non-local pseudopotential is implemented within DMC by 
the locality approximation [210]. All QMC results are from the qmcPACK package [3]. 

The form of the trial wave function is the product of Slater determinants and Jastrow 
correlation factors: 



tf T (R) = D T ( ri , . . . , r N ^)D l (r Nl+1 , ...,r N ) exp 



(8.1) 



The Slater determinants for the spin-up electrons and spin-down electrons are con- 
structed from single particle orbitals obtained from HF. The Jastrow electron-electron cor- 
relation factor u is spin-independent and has the form 

u(rij) = arij/(l + br i:j ), (8.2) 

where r^- = |r, — Tj\ is the electron-electron distance. The parameter a = 0.5 to satisfy 
the cusp condition for opposite spin electrons and b is adjustable. The cuspless electron-ion 
Jastrow factor has two adjustable parameters and is of the form 

Xi(ru)=a I /{l + b I r%), (8.3) 

where / is the ion index and rn = jr, — Rj| is the electron-ion distance. All of the Jastrow 
parameters are optimized within the VMC correlated sampling framework [155] by mini- 
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mizing a combination of the local energy and the variance of the local energy. Since the 
Jastrow terms are positive and multiplicative, the nodes of ty? are determined by the Slater 
determinants and hence the accuracy of the FN-DMC results via the fixed-node error. 

The Ge pseudopotential is discussed in Section 7.4, a CPP is added to this HF pseudopo- 
tential to account for core-valence correlations. 

The Gaussian basis to describe the valence electrons of a Ge atom is the uncontracted 
set 5s5pld [215]. The Gaussian exponents of this basis are optimized for each atomic con- 
figuration within HF. The quality of this basis is verified by the fact that the energy of the 
ground state 4s 2 4p 2 ( 3 P) configuration is within a mHartree of the basis set limit [209], see 
Table 8.1. HF total energy calculations performed by the Gaussian03 [11] package for each 
of the atomic configurations provide the single particle orbitals for the Slater determinants 
< 
] 




Figure 8.1: The basis for the ground state ( 3 P) of the Ge atom. 
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Table 8.1: Comparison of the HF results for the 5s5pld basis to the basis set limit (infinite 
basis) for the ( 3 P) ground-state of the Ge atom. All energies in a.u. 



Basis 


5s5pld 


Infinite-Ref. [209] 


Total Energy 


-3.66625 


-3.66673 



Table 8.2: The CPP estimator (Hcpp) of various atomic configurations of Ge. All energies 
in eV. 



No. of electrons Configuration 


(Hcpp) 
VMC DMC 


4 As 2 Ap 2 ( 3 P) 
AsAp 3 ( 5 S) 


-1.74153(21) -1.76550(68) 
-1.35562(26) -1.37333(49) 


3 AsHp( 2 P) 
4s4p 2 ( 4 P) 


-1.68520(30) -1.73214(68) 
-1.34206(26) -1.31730(49) 


2 4s 2 { 1 S) 


-1.73938(30) -1.75481(87) 


1 4s( 2 S) 


-1.09059(14) -1.10514(41) 



8.3 Results for Excitations 

Table 8.2 shows the results for the CPP estimator (Hcpp) from VMC and DMC for several 
atomic configurations. The CPP estimator serves as a measure of the core- valence correlation 
energy. In VMC (Hcpp) is evaluated by averaging the quantity ^^Hcpp^t with respect 
to the distribution |\I/t| 2 , where \I/t is the trial wave function. In DMC the averaging is 
performed with respect to the mixed distribution ^t^O; where $o is the lowest energy state 
consistent with the nodes of \1/t- The magnitude of (Hcpp) are not noticeably different in 
VMC and DMC. In general, (Hcpp) is largest for the configurations which have two electrons 
in a 4s— orbital, indicating an angular momentum dependence. The 4s 2 4p 2 ( 3 P) ground-state 
and the As4p 3 ( 5 S) excited state configurations in Table 8.2 both contain four electrons, but 
the magnitude of (Hcpp) is larger for 4s 2 4p 2 ( 3 P) by more than 25%. Consequentially, it can 
be expected that the size of the CPP correction for atomic excitations depends on the initial 
and final occupation of the s— orbitals. 

Table 8.3 gives the QMC results, with and without the CPP, for several excitations of 
the Ge atom; including the s removal energy, the s —* p promotion energy and the first 
four ionization potentials. We have also included the total energy of the pseudoatom with 
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and without the CPP. All excitation energies are evaluated as total energy differences; see 
Table 8.4 for the QMC results for the total energies. The CPP correction is calculated as the 
difference of the VMC (DMC) and VMC+CPP (DMC+CPP) results, where VMC+CPP 
(DMC+CPP) indicates inclusion of the CPP. The CPP correction can also be evaluated as 
the difference of the CPP estimators for the ground and excited states: (TCcpp) — {'Hcpp)- 
As will be shown in the next section, these two methods of calculating the CPP correction 
are equivalent in VMC and are in very close agreement in DMC. The inclusion of the CPP 
increases the excitation energies and significantly improves both the VMC and DMC results 
relative to experiment. The best values being the DMC+CPP ones. Interestingly, the 
VMC+CPP results are in closer agreement to experiment than the DMC results for most of 
the excitations. From this we can conclude that the gain in energy due to the inclusion of 
core-valence correlations via the CPP are larger than the gain in energy due to the improved 
treatment of valence-valence correlations by DMC. As expected from Table 8.2, the CPP 
correction is largest for the excitations that involve removing or promoting an electron in 
a s— orbital, such as the s removal energy and s — > p promotion energy. In contrast, the 
CPP correction is almost negligible for the first two ionization energies, which correspond 
to removing electrons in p— orbitals. 

Previous QMC studies of Ge [49, 50, 51] have reported results for the total energy of the 
ground-state of the Ge pseudoatom. These calculations feature a local [52] LDA pseudopo- 
tential and use single-particle orbitals from LDA to construct the Slater determinant part 
of the trial wave functions. The best VMC result is -103.22(1) eV and the best DMC result 
is -103.42(3) eV [51]. 

Table 8.3 also includes HF results. This is mainly because we used the HF orbitals to 
construct the Slater determinants and to demonstrate how significantly QMC improves upon 
the HF results by treating the valence- valence interactions at a many-body level. It should be 
remembered that HF completely omits correlation and therefore tends to underbind atoms 
and the valence levels. This means that both the total energy and the valence eigenvalues 
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are not sufficiently negative [205]. In addition, GW results from Shirley [17] are included 
where available. 
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Table 8.3: For the Ge atom results for the s — > p promotion energy, the s removal energy (s~), 
the first four ionization energies and the total energy of the pseudoatom, with and without 
the CPP. All excitation energies are total energy differences. The CPP correction, ACPP, is 
calculated as the energy difference of the results with and without the CPP. Experimental 
results from Ref. [15] have been properly spin-orbit corrected [16]: the J weighted averaging 
has been performed with respect to the 4s 2 ( 1 S') configuration, see Appendix A. 10. All 
energies in eV and the QMC error bars are smaller 0.01 eV. GW results from Ref. [17]. 





s — ► p 


s 




s 2 p 2 ( 3 P) -> 


s 2 p 2 ( 3 P) -> 




sp 3 ( 5 S) 


sp 2 ( A P) 


Expt. 


5.08 


14.31 


DMC+CPP 


5.12 


14.37 


DMC 


4.74 


13.94 


ACPP 


0.38 


0.43 


VMC+CPP 


4.90 


14.24 


VMC 


4.51 


13.84 


ACPP 


0.38 


0.40 


GW 


4.3 


13.5 


HF 


3.48 


12.28 





1st IP 


2nd IP 


3rd IP 


4th IP 


Total 




s 2 p 2 ( 3 P) -> 


s 2 p( 2 P) -> 


s 2 ( 1 S) -> 


s( 2 S) - 


Energy of 




s 2 p( 2 P) 


s 2 ( 1 S) 


s( 2 S) 


[Ar^YS) 


s 2 p 2 ( 3 P) 


Expt. 


7.93 


15.79 


34.06 


45.72 


103.50 


DMC+CPP 


7.86 


15.68 


34.06 


46.00 


103.60 


DMC 


7.85 


15.66 


33.46 


44.90 


101.86 


ACPP 


0.02 


0.01 


0.60 


1.11 


1.74 


VMC+CPP 


7.89 


15.43 


34.02 


45.98 


103.31 


VMC 


7.83 


15.48 


33.38 


44.89 


101.57 


ACPP 


0.05 


-0.05 


0.65 


1.09 


1.74 


HF 


7.41 


15.27 


32.29 


44.89 


99.85 
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HF 



VMC 



VMC+CPP 



DMC 



DMC+CPP 



(4s) 2 (4p) 2 
(4s) (4p) 3 
(4s) 2 (4p) 
(4 S )(4p) 2 

(4s) 



-99.85330 -101.57461(54) -103.31447(54) -101.8640(15) -103.6005(16) 



-96.37805 
-92.44232 
-87.57443 
-77.17685 
-44.88760 



-97.06045(49) 
-93.73979(79) 
-87.73108(71) 
-78.26320(57) 
-44.88768(33) 



-98.41686(49) -97.11961(65) -98.48026(73) 

-95.42760(84) -94.0159(16) -95.7371(17) 

-89.07211(68) -87.9200(11) -89.2270(12) 

-80.00258(54) -78.35362(95) -80.06203(90) 

-45.97826(35) -44.89895(38) -46.00457(41) 



Table 8.4: Hartree-Fock (HF) and QMC total energies for various atomic configurations of 
Ge. All energies in eV. 
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Chapter 9 

Results for Molecular Systems 



9.1 Introduction 

The importance of core- valence correlation for the atomic excitations of Ge provides a moti- 
vation to study the effect of the core polarization (CPP) on larger Ge structures. The next 
step in this thesis is to examine and quantify the effect of the CPP on the excitations of 
the closed-shell molecules GeH4 and Ge 2 H6- These molecules represent the molecular limit 
of the hydrogen passivated structures which have been the subject of several theoretical 
studies [58, 59, 60, 61, 62, 63, 64, 65, 1, 28]. The results for GeH 4 and Ge 2 H 6 provide a 
benchmark for predicting the effect of the CPP on the larger structures GesHi 2 , GeioHi6, 
Gei 4 H 20 and Ge 2 9H 36 , which are discussed in the following chapter. One purpose of the 
present calculations are to demonstrate that the CPP correction can be calculated from the 
CPP estimators instead of energy differences; this is useful for the larger clusters where it 
leads to a considerable saving in computer time. In addition, the results for excitation spec- 
tra can be compared with experiment and with corresponding results for SiH4 and Si 2 Hg, 
which is useful for the analysis of the larger clusters given in the following chapter. 

9.2 Computational Method 

The Gaussian basis for the Ge atoms consist of the uncontracted set AsApld. This basis is 
nearly identical to the basis used for the atomic calculations (Section 8.2), with the exception 
that we removed a single diffuse sp shell. The H atomic basis is the primitive set 21slp which 
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is contracted to 3slp. The 10 Gaussians which are contracted to form the Is and 2s orbitals 
of H are fit to Slater type orbitals. This serves to better satisfy the cusp condition at the 
positions of the H nuclei, and has been found to significantly reduce the variance of the local 
energy El = ^T^Ti^T- For the Jastrow part of the trial wave function we used electron- 
electron and electron-nuclear correlations. The Jastrow part of contains a total of five 
adjustable parameters. All DMC calculations use a time-step of 0.05 a.u. 

The discussion of the Ge pseudopotential and CPP found in Chapter 7. The H nuclei 
are represented by a — 1/r bare-core potential. 

The optical excitations are calculated in QMC following the method in Section 4.13. 
Only the ground state trial wave function is optimized; optimization of the excited state 
wave function was found to have an insignificant effect on the optical gap. 

For the analysis of the effect of the CPP on the properties of GeH 4 and Ge 2 H 6 , we chose 
to use the best available experimental data for the geometries. For the GeFJ-4 molecule, we 
use the bond length 1.520 A from Ref. [216] and references therein. For Ge2Hg, we use the 
Ge-Ge bond length 2.403 A, the Ge-H bond length 1.541 A and Ge-Ge-H bond angle 112.5° 
[217], see Figure 9.2. 

9.3 Results for Atomization Energies 

We have obtained the atomization energies of 12.725(5) eV for GeH_4 and 20.812(6) eV 
for Ge2H6 in DMC with the addition of the CPP. The agreement with both experiment 
and other theoretical values is reasonable. Our DMC+CPP result for GeH 4 is close to the 
experimental value of 12.490(37) eV; this value has been extrapolated to K and corrected 
for the zero-point energy and spin-orbit averaged for the Ge atoms [218]. For both GeH 4 
and Ge2H 6 we can compare directly with the atomization energies calculated previously by 
Ricca and Bauschlicher [219] using coupled-cluster singles and doubles with perturbative 
triples [CCSD(T)] methods which were extrapolated to attempt to reach the basis set limit. 
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r (au) 



(a) The atomic Ge basis. 



4 




r (a.u.) 



(b) The atomic H basis. 

Figure 9.1: The basis sets for the Ge atoms 9.1(a) and the H atoms 9.1(b). For H the linear 
combination of 10 Gaussians used to construct the Is and 2s orbitals are each fit to a single 
STO. 
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(a) GeH 4 (b) Ge 2 H 6 

Figure 9.2: The GeH 4 and Ge 2 H 6 molecules. 

Using the same relaxed geometries found by those authors, our DMC+CPP atomization 
energies of 12.719(5) eV for GeH4 and 20.800(6) eV for Ge2Hg are very close to the values 
of 12.78 eV and 20.99 eV found by Ricca and Bauschlicher. 1 Earlier work by Nicklass and 
Stoll [195] differs by ~ 0.8 eV, which may be attributed to basis set convergence issues and 
differences in the geometries. Nonetheless, our work and that of Nicklass and Stoll conclude 
that core-valence correlation has only a small effect for the atomization energies since the 
magnitude of the core-valence interaction energy is roughly the same size for the molecule 
and the constituent atoms. 

9.4 Treatment of the CPP: Estimating the CPP 
Correction to Excitations 

One purpose of the calculations on the molecules is to show that it is possible can extract the 
CPP correction, ACPP, for excited state properties without having to perform two sets of 
1 All comparisons are to the results from Table 5, excluding the spin-orbit and zero-point energy terms. 
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calculations, i.e. with and without the presence of the CPP in the valence Hamiltonian 7i va i- 
This is especially important for the larger Ge n H m nanoclusters which involve considerably 
more computation time than the GeH4 and Ge2Hg molecules to attain the same level of 
accuracy in the optical gap. Ideally, the CPP correction for excitations can be calculated as 
the difference of the CPP estimators of the ground and excited state 

ACPP « A(H CPP ) = \(Hcpp) ex - (Hcpp) gs \. (9.1) 

Hence, it is necessary to determine the accuracy of the CPP estimator in both the VMC 
and DMC methods. In VMC, the CPP estimator is equal to the difference of the energies 
obtained with and without the CPP 

(H-cpp) = (H-vai+cpp) — (Hval), (9-2) 

since all of the averages ( ) are computed with respect to the probability distribution ^>\. 
However, in the DMC method the estimators are computed with respect to the mixed dis- 
tribution / = ^l/y^o- The inclusion of the CPP in the valence Hamiltonian has the effect 
of changing the DMC solution: $ h_ * ^q PP - Therefore, Eq. (9.2) is only an approximate 
relation in DMC, (H V ai+cpp) — (Hcpp) ~ (Hvai)- The terms on the left hand side of the 
equation are averaged with respect to the distribution ^/t^o PP , while the term on the right 
hand side is averaged with respect to \1/t$ - 

Table 9.1 presents the comparison between the CPP estimator (Hcpp), and the energy 
difference (AE) = (Ti-vai+cpp) — (Hvai), for the the total energy of the ground state and an 
excited state of the Ge2H 6 molecule. (The excited state is created by promoting an electron 
from the highest occupied molecular orbital (HOMO) to the lowest unoccupied molecular 
orbital (LUMO) in the spin- up Slater determinant). The VMC results are within an error 
bar of each other, while the DMC results are close enough to justify replacing AE with 
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(Hcpp). Table 9.1 also presents results for the accuracy of the CPP estimator method for 
the lowest singlet transition, which is the difference between the excited state energy and 
the ground state energy. For this case the comparison is between the difference of the CPP 
estimators of the excited state and ground state, see Eq. (9.1), and the difference of the 
energy of the singlet transition with and without the CPP in the valence Hamiltonian. Once 
again, the VMC results are within an error bar of each other, while the DMC results are 
within approximately 0.05 eV. Therefore, it can be concluded that it is possible to determine 
the CPP correction to excited state properties by subtracting the CPP estimators for the 
ground and excited states. All subsequent mentions to the CPP correction will be calculated 
by Eq. (9.1). Additionally, from Table 9.1 it can be seen that the magnitude of the CPP is 
larger for the excited states than for the ground state, indicating that the CPP should lower 
the energy of the optical excitations of Ge 2 H 6 . 

Table 9.1: The effect of the CPP (eV) on the ground and excited state energies of Ge2H 6 
calculated by two methods: (i) the differences in the QMC energies with and without the 
CPP (AE) = (H V ai+cpp) ~ (Hvai) an d the CPP estimator (H C pp). The excited state is 
constructed by promoting a spin-up electron from the HOMO and placing it in the LUMO. 
Results are also presented for the effect of the CPP on the energy difference between the 
excited state and ground state, which is the singlet transition. 



CPP 


Ground State (GS) 


Excited State (EX) 


EX - GS 


VMC ( Hcpp } 
VML (AE) 


-3.1976(21) 
-3.203(17) 


-3.4191(22) 
-3.412(18) 


-0.2215(30) 
-0.209(25) 


DMC ( Hcpp } 
DML (AE) 


-3.3654(14) 
-3.471(10) 


-3.5849(14) 
-3.638(22) 


-0.2195(20) 
-0.167(24) 



9.5 Results for Excitations 

The main interest is determining the effect of core- valence correlation on the singlet transi- 
tions of GeH4 and Ge2Hg, where the optical gap is the lowest dipole allowed singlet transition. 
Table 9.2 gives the VMC+CPP and DMC+CPP transition energies, as well as the change of 
energy due to the inclusion of the CPP, for the first two singlet transitions of GeH 4 , and the 
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first three singlet transitions of Ge2Hg. The Ge2Hg molecule provides a better example for 
studying the effect of core-valence correlation since it is the first of the Ge n H m structures to 
include a Ge-Ge bond. Each of these optical transitions involves removing an electron from 
the HOMO - t 2 f° r GeH 4 and A\ g for Ge2H 6 - and promoting it to an excited state molecular 
orbital, which are ordered by their single particle energies. The QMC gaps are considerably 
lower than the HF gaps, with the DMC+CPP gaps being the lowest. The inclusion of the 
CPP has the effect of lowering the transition energies. 

Table 9.2: The QMC results (eV) for the first two singlet transitions of GeH4 and the 
first three singlet transitions of Ge2H 6 , with the corresponding CPP corrections ACPP. 
The transition energies are calculated by Eq. (4.96), where the excited state is created by 
promoting an electron from the HOMO to the excited state orbital denoted by its irreducible 
representation. The transition corresponding to the optical gap is in bold. We have used 
the experimental geometries for GeH4 and Ge2Hg. 





GeH 4 [T d ] 
HOMO t 2 


Ge 2 H 6 [D 3d ] 
HOMO A lg 


Excited State 


a 1 t 2 


E U A\g 


HF 


17.719 18.372 


15.006 15.155 15.786 


VMC+CPP 
ACPP 


10.266(12) 10.576(12) 
-0.2488(20) -0.0944(20) 


7.976(18) 8.088(17) 9.146(19) 
-0.2215(30) -0.0185(44) -0.2242(33) 


DMC+CPP 
ACPP 


9.479(13) 10.110(11) 
-0.2221(18) -0.0732(17) 


7.541(12) 7.922(11) 8.169(15) 
-0.2195(20) -0.0154(19) -0.1994(19) 



The size of the CPP correction strongly depends on the symmetry of the excited state. 
For GeH 4 , the size of the CPP correction is much larger for the t 2 — > a\ transition than the 
optically forbidden t 2 — > t 2 transition. This is also the case for Ge2Hg : the CPP correction 
is largest for the A\ g —* A 2u and A\ g — > A\ g transitions, while it is almost negligible for 
the A\ g — ■> E u transition. The difference in the CPP corrections can be explained by the 
relative s to p Ge atomic orbital character of the excited state molecular orbitals. For GeH4, 
the ai molecular orbital has a s character of the Ge atomic orbitals while the t 2 has a p 
character. For Ge2Hg, the Ai g and A 2u molecular orbitals have a s character of the Ge atomic 
orbitals, while the E u molecular orbital has a character of the Ge p atomic orbitals. As was 
demonstrated for the Ge atom in Chapter 8, the size of the core- valence correlation energy is 
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(a) HOMO - t 2 



(b) LUMO - a t 




(c) 2nd LUMO - t 2 

Figure 9.3: Isosurfaces for several molecular orbitals of GeH4, including the HOMO and the 
first two excited orbitals (red +, blue -). The MO is triply degenerate. The irreducible 
representation is also denoted. 
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(a) HOMO - A lg 



(b) LUMO - A 2 . 




(c) 2nd LUMO - E u (d) 3rd LUMO - A Xg 

Figure 9.4: Isosurfaces for several molecular orbitals of Ge2H 6 , including the HOMO and the 
first three excited orbitals (red +, blue -). The E n MO is doubly degenerate. The irreducible 
representation is also denoted. 
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different for different angular momentum channels, and is a larger for electrons in s-orbitals 
than in p-orbitals. Figures 9.3 and 9.4 plot the isosurfaces of the molecular orbitals used to 
evaluate the transition energies in Table 9.2 for GeH 4 and Ge2Hg: 2 from the isosurfaces 
it is relatively easy to visualize the relative Ge atomic s and p character of the molecular 
orbital. Table 9.3 provides the total energies of the ground and excited states of GeH 4 and 
Ge2H6 used in constructing Table 9.2, while Table 9.4 provides the CPP estimators. 

Table 9.3: Hartree-Fock (HF) and QMC total energies for the ground-state E GS and excited 
states E EX of GeH4 and Ge2Hg: the subscript on E EX denotes the unoccupied molecular 
orbital to which the spin-up electron is promoted. The best available experimental geometries 
are used. All energies in eV. 



(a) GcH 4 





HF 


VMC 


VMC+CPP 


DMC 


DMC+CPP 


E GS 

rpEX 
^LUMO 
rpEX 

rj 2ndLUMO 


-163.8638 
-146.1445 
-145.4920 


-167.9656(73) 
-157.4558(93) 


-169.5953(73) 
-159.3293(90) 
-159.0196(93) 


-169.0029(71) 
-159.369(12) 


-170.7480(82) 

-161.269(11) 

-160.6384(82) 


(b) Ge 2 H 6 




HF 


VMC 


VMC+CPP 


DMC 


DMC+CPP 


E as 

rpEX 
^LUMO 
rpEX 

rj 2ndLUMO 
rpEX 

^ZrdLUMO 


-296.7392 
-281.7332 
-281.5838 
-280.9534 


-304.485(11) 
-296.300(11) 


-307.688(12) 
-299.712(13) 
-299.600(12) 
-298.542(14) 


-306.1756(68) 
-298.468(20) 


-309.6470(76) 
-302.1064(90) 
-301.7249(76) 
-301.481(13) 



9.6 Comparison to Experiment 

The results for GeH4 can be compared with experiment and with previous QMC calculations 
for SiH 4 [48, 26, 57]. Because there is controversy and different interpretations of the experi- 
mental results, we will use the previous theoretical work on SiH4 to make direct comparisons 
of the results and to aid in the comparison to experiment. 

The absorption spectrum of GeH 4 has been reported to display broad peaks at 8.3 eV, 
9.4 eV and 10.4 eV [19, 221, 222], although there is some disagreement about the location of 
2 All isosurfaces made using the XCrySDen package [220]. 
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i ' 1 p 1 r 




Wavelength (nm) 



(a) The experimental absorption cross section for SiH4 and GeH4 
from Itoh et al. [19]. The proposed Rydberg transitions are also 
indicated. The peaks for SiH4 are located at 8.8 eV, 9.7 eV and 
10.7 eV; GeH 4 the peaks are at 8.3 eV, 9.4 eV and 10.4 eV. 




KINETIC ENERGY 
1 1 ■ ■ I I ■ I . I I I I . . t I > ■ . I I ■ I I [ I ■ ■ ■ I . I ■ * 1 I . 

I4.S 14.0 (3.5 13.0 123 11.0 H.S (t.O 



(b) The photoelectron spectrum of GeH4 from Pullen et al. [18]. 
The peaks are located at 11.98 and 12.46 eV. The vertical scale is 
the counts/second. 
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Table 9.4: The CPP estimator (Hcpp) for the ground-state and excited states of GeH4 
and Ge2H6. The best available experimental geometries are used. The excited state is 
constructed by promoting a spin-up electron from the HOMO and placing it in the MO in 
parenthesis. All energies in eV. 



(a) GeH 4 




VMC DMC 


E GS 

rpEX 
^LUMO 
rpEX 

rj 2ndLUMO 


-1.6273(12) -1.6985(14) 
-1.8761(16) -1.9206(11) 
-1.7217(16) -1.7718(9) 


(b) Gc 2 H 6 




VMC DMC 


E GB 

rpEX 
^LUMO 
rpEX 

rj 2ndLUMO 
rpEX 

^ZrdLUMO 


-3.1976(21) -3.3654(14) 
-3.4191(22) -3.5849(14) 
-3.2161(24) -3.3808(13) 
-3.4219(26) -3.5648(13) 



the second peak [221]. The first peak has been assigned as a £2 —> cti(5s) Rydberg transition, 
but there is disagreement in the assignment of the other peaks [19, 221]. The ionization 
energy is determined by the photoelectron spectrum, which displays two distinguishable 
peaks at 11.98 eV and 12.46 eV [18]. The same features are found in SiH 4 . The photoelectron 
spectrum of SiH 4 displays the same double peak structure as GeH 4 , with peaks at 12.36 eV 
and 12.85 eV [18]. The absorption spectrum of SiH 4 also displays a nearly identical structure 
to that of GeH 4 , with the peaks shifted to higher energies at 8.8 eV, 9.7 eV and 10.7 eV. 
Thus the overall conclusion is that both the ionization potential and the gaps are 0.3-0.5 eV 
higher in SiH 4 than in GeH 4 . 

The DMC+CPP results for the ionization energy and the optical gap of GeH 4 are shown 
in Table 9.5 for both the experimental bond length (the same is in Table 9.2) and the calcu- 
lated LDA bond length of 1.571 A. The previous results for SiH 4 that are most comparable 
to our work are those of Porter et al. [48] who carried out a DMC+CPP calculation using 
the LDA bond length for SiH 4 . Their DMC+CPP result of 9.47(2) eV for the optical gap 
can be compared directly with the present result of 8.93(1) eV, i.e. a decrease in the gap 
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of 0.5 eV in going from SiH 4 to GeH4. This is consistent with the difference in experiment 
between SiH 4 and GeH 4 . 

A full analysis of the optical structure and the experimental results requires an inter- 
pretation of the multiple peaks seen in experiment. Porter et al. [48] maintained, support- 
ing earlier all-electron multi-reference single- and double-excitation configuration interaction 
(MRD-CI) work by Chantranupong et al. [223], that the first two peaks of the SiH4 spec- 
trum actually derive from a Jahn- Teller splitting of the triply degenerate t 2 — > 4s transition, 
similar to the splitting in the photoelectron spectrum for the ionization energy [18]. 3 (The 
excited state involves partially filled degenerate t 2 HOMO levels, which is subject to Jahn- 
Teller distortion that can break the symmetry and lift the degeneracy.) Thus the single 
calculated gap should be compared to the two experimental peaks, i. e. lie in between the two 
peaks, which is consistent with the calculated gaps for SiH 4 and GeH 4 . This interpretation 
is different from Grossman et al. [57], whose result of 9.1(1) eV is closer to the first peak 
in the absorption spectrum. It is important to note that spin-orbit effects, which we have 
neglected, are expected to be considerably larger for GeH 4 than for SiH 4 . 

Table 9.5: DMC+CPP results (eV) compared to experiment for the ionization potential 
(IP) and optical gap (E"^) of GeH 4 in its experimental and LDA relaxed geometry. The 
IP is calculated as the total energy difference of the N and N — 1 electron systems: IP = 
E[N] — E[N — 1]. The optical gap is calculated by Eq. 4.96. The experimental data for the 
IP are from Ref. [18] and the optical gap from [19]. See the text for a discussion of the 
comparison of our single energy with the two peaks observed in experiment. 







Ge-H bond length: 








Expt. LDA 


Expt. 


IP 


DMC+CPP 
ACPP 


12.577(12) 12.4963(69) 
-0.0456(19) -0.0820(17) 


11.98,12.46 


rpopt 

gap 


DMC+CPP 
ACPP 


9.479(13) 8.934(13) 
-0.2221(18) -0.2272(21) 


8.3,9.4 



3 Several theoretical investigations have attempted to find the most stable structure of the GCH4+ ionized 
molecule, although there is a disagreement about which is the correct structure [224, 225]. This is especially 
useful in determining the adiabatic ionization energy. 
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Chapter 10 

Results for Nanoclusters 



10.1 Introduction 

In this chapter the optical gaps of the molecules and nanoclusters GeH4, Ge2H6, GesH^, 
Gei H 16 , Gei 4 H 2 o and Ge 2 9H 36 are presented (see Figure 10.11). The CPP approach outlined 
in Chapter 7 is used to treat the core- valence interactions. The computation details regarding 
the trial wave function, including the Ge and H basis sets, are identical to the description 
in Section 9.210.5. 

10.2 The Structure of the Nanoclusters 

Nanostructured Ge and Si systems have been fabricated by a variety of techniques. Very 

small, nearly spherical Ge nanocrystals have been synthesized by rf cosputtering of Ge and 

Si02 [22], ion implantations [226], as colloidal nanocrystals [53] and reduction of metastable 

compounds [82]. It is typically assumed that these nanocrystals or nanoclusters retain a 

tetrahedral coordination in the center and that the surface dangling bonds are saturated, 

usually by hydrogen: 1 these nanoclusters are known as the hydrogen terminated Ge or Si 

nanoclusters (Ge n H m or Si n H m ). Unfortunately the exact ground state structures of all but 

the smallest Ge and Si nanoclusters are unknown. At the present time the biggest obstacle 

to commercial adaption of Ge or Si nanoclusters is the reliability of the fabrication process; 

it is difficult to prepare nanoclusters in a size and shape consistent manner, and without the 
1 Hydrogen is present in many of the preparation procedures [53, 72] 
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presence of contaminants. 

The ground-state structures of all but the smallest Ge nanoclusters are unknown. Almost 
all the theoretical studies of Ge n H m and Si n H m have used LDA relaxed geometries derived 
from the bulk diamond structures of Ge and Si, as shown in Figure 10.1(f) for the Ge2gH 36 
nanocluster. The motivation for this approach is twofold: (i) for large nanoclusters it is 
reasonable to expect that the core maintains bulk-like properties: (ii) bulk derived structures 
are ideal for studies that attempt to approach the bulk limit. Hydrogen is placed along the 
appropriate tetrahedral direction to saturate all the dangling bonds. The initial geometries 
of our clusters are determined by experimental values for the Ge-Ge bond length (4.65 A) 
from the bulk and the Ge-H bond length (1.520 A) from the GeH 4 molecule. Then, the 
structures are fully relaxed in DFT-LDA using an identical procedure to TM [1]. The LDA 
relaxation reduced the Ge-Ge bond by less than 1% and increased the Ge-H bond by 3-4%. 
We have not attempted to relax the geometries of the nanoclusters in their optically excited 
states to take the Stokes shift into account for the optical excitations. 

The fashion in which the nanoclusters are synthesized has been found to significantly 
effect the final structure that forms, and hence their optical properties. Draeger et al. [169] 
employed first-principles molecular dynamics (FPMD) and QMC to determine impact of 
the experimental synthesis process on structural and optical properties. This study was 
motivated by experiments that found optical gaps of Si clusters formed by physical vapor 
deposition several eV lower [227] than those formed using inverse micelles [228]. Draeger et al. 
found that the number of atoms in the core is larger than ideal bulk-derived structures when 
a finite temperature synthesis process is emulated. It was observed during the formation 
process of 1 nm clusters (approximately 30 Si atoms) that the relaxation of the high strain 
induced by the curvature and dangling bond states at the surface is in direct competition with 
the preference of the interior atoms to be tetrahedrally coordinated. It was also observed 
that the type of core structure that forms depends on the mechanism by which the surface 
is passivated during the synthesis method. 
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(e) Ge 14 H 20 (f) Ge 29 H 36 

Figure 10.1: The Ge n H m nanoclusters. 
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10.3 Previous Experimental and Theoretical Work 



10.3.1 Review of Experiments 

There have been many experimental reports of strong visible photoluminescence (PL) in Ge 
nanocrystals in an SiC>2 matrix prepared by a variety of methods [75, 79, 80, 81, 82, 83, 84, 
85, 86, 87, 88, 89]. In all of these reports the strong PL maximum was observed at about 
2.2 eV, independent of size of the nanocrystals (in the 2-15 nm size range), and cannot 
be explained by a quantum confinement model. It has been suggested that defects in the 
SiC>2 matrix region or the interface between the SiC>2 matrix and the Ge nanocrystals are 
responsible for the size- independent visible PL [22]. 

Takeoka et al. [22] observed size-dependent PL for Ge nanocrystals in the 0.9 - 5.3 nm 
diameter size range in the near infrared region, which is closer to the bulk band gap of Ge 
(0.66 eV at 300 K), see Figure 10.2. The Ge nanocrystals were fabricated by rf cosputtering 
of Ge and Si02 and post annealing. For the sample with an average diameter d ave of 5.3 nm, 
the PL peak was observed at about 0.88 eV. Decreasing the diameter of the nanocrystals 
had the effect of shifting the PL peak to higher energies, with the maximum of 1.54 eV for 
the sample with an d ave = 0.9 nm. The PL intensity was also observed to increase with 
decreasing diameter. 2 

Takeoka et al. maintained that the strong PL detected in their experiments originated 
from a recombination of electron-hole pairs confined to the nanocrystal Ge [22]. However, 
Niquet et al. [58] arrived at a different interpretation, due to an incompatibility between their 
own semi-empirical (sp 3 tight binding) calculations and the experimental results. Niquet et 
al. [58] maintained that the size dependent PL in the near infrared region is probably 
related to the recombination of electron (hole) trapped on a surface defect with free hole 
(electron) confined in the nanocrystal. In addition, Niquet et al. [58] suggested that the size 

2 It is important to emphasize that size-dependent PL have been observed in Si nanoclusters [68, 229, 
230, 231, 73, 232, 233, 74]. 
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independent blue-green PL observed previously by so many researches, see Refs. [75, 79, 80, 
81, 82, 83, 84, 85, 86, 87, 88, 89], comes from defects in the oxide (Si0 2 defects). 

Taraschi et al. [54] optically characterized nano crystalline Ge samples with infrared 
photoluminescence measurements at 4 K, using a 488 nm Ar laser excitation. The nanocrys- 
talline Ge samples with small nanocrystals (<15 nm) exhibited a broad infrared PL back- 
ground ranging from about 1.0 — 1.7 fim. The broad background is attributed to quantum 
confinement in the Ge nanocrystals, consistent with a wide distribution of nanocrystal sizes 
and hence a broad range of confinement energies. The exact mechanism for the elecron-hole 
recombination is not known: the possibilities include the recombination of both confined 
holes and electrons in nanocrystals or the recombination of an electron (hole) trapped on a 
nanocrystal surface defect, with a free hole (electron) confined in the nanocrystal. 

More recently, Konchenko et al. [55] prepared spherical Ge dots of 3-10 nm in diameter on 
a 0.3 nm thick Si02 film of Si(lll) substrate, similar to the procedure reported by Ichikawa 
et al. [234]. These nanocrystals can be reasonably regarded as a model of Ge nanoclusters 
embedded in an Si02 matrix. Quantum confinement of valence band of Ge nc observed by 
photoemission spectroscopy. 

10.3.2 Theoretical Overview 

Most of the theoretical work on Ge and Si nanoclusters has been dedicated to predicting 
the dependence of the optical gap with respect to the cluster size [23, 58, 25, 63, 64, 65, 
24, 235, 26, 2, 56, 61, 1, 28]. For larger clusters (> 500 atoms), the optical gap can be 
approximated by calculating the quasiparticle gap and using semi- empirical approaches to 
correct the relatively small (few meV) exciton binding energy. However, in 0-2 nm size range 
these approaches break down due to poor descriptions of the exciton binding energy and an 
inadequate description of the nanocluster surface [2] . Our interest is is mainly confined to 
nanoclusters in this size range. 

Semi-empirical treatments are based on knowledge of the electronic structure of the bulk 
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Figure 10.2: Experimental results for the dependence of photoluminescence spectra with 
respect to the average diameter of nano-crystaline Ge from Takeoka et al. [22]. The photo- 
luminescence spectrum of the SiC>2 film is also displayed. 

material (measured or computed) and the transferability to the nanoscale size range has to 
be questioned [58]. This holds for tight-binding and the empirical pseudopotential approach 
[63] . In addition, the tight-binding parametrization of bulk Ge is not unique and the extent 
to which this parametrization remains valid for small nanocrystallites is unknown [28]. The 
effective mass approximation is known to overestimate optical gaps for crystallites in the 
nanometer size region [236, 237]. Therefore, our interest is mainly confined to ab-initio 
work. 



10.3.3 Focus on Ge Systems 

There have been several studies of Ge nanoclusters by a variety of methods, including the 
effective mass approximation [238], tight binding [239, 58, 240], empirical pseudopotential 
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[59, 60] and ab-initio work such as DFT [58, 61, 62, 63, 64, 65], including TDDFT [1, 
119, 28]. Some of the more notable work includes an sp 3 tight-binding study by Niquet 
et al. [58], who derived analytic laws for confinement energies over a whole range of sizes. 
Their work for the smaller nanoclusters was corroborated by ab initio LDA calculations. 
Melnikov and Chelikowsky [61] calculated the absorption spectrum in DFT-LDA using a 
real-space method. One of their most notable findings was that the spectra was essentially 
bulk-like for systems containing more than 250 atoms. In a related study, Melnikov and 
Chelikowsky [62] found that the calculated ionization potentials and affinities of the Ge n H. m 
and the Si n H m nanoclusters remain significantly different from the corresponding bulk values 
up to systems containing around 800 atoms. Tsolakidis and Martin [1] and Nesher et al. 
[28] have calculated the absorption spectrum by TDDFT using the adiabatic local density 
approximation (ALDA). Weissker and co-workers have studied nanocluster Ge using a ASCF 
method 3 within LDA and LSDA [63, 65], where one study included structural relaxation 
of the ground state atomic configurations [64]. At the present time there are no published 
works that treat the optical excitations of the Ge nanoclusters in a many-body sense. The 
present QMC results are among the first truly many-body calculations of the excitations of 
the Ge nanoclusters. 

10.3.4 Focus on Si Systems 

Far more theoretical work has been dedicated to the study of Si nanoclusters. Among the 
many-body calculations for the Si n H, m nanoclusters are the DMC results of Williamson et 
al. [2] in the to 1.5 nm diameter size range, with the Si87-f^76 nanocluster being the largest 
in the study. Their DMC optical gaps were found to be in excellent agreement with earlier 
GW-BSE results by Rohlfing and Louie [25] for the first four fully hydrogenated Si n H m 

3 The ASCF has been briefly discussed in Section 2.3.2. Weissker et al. [63, 65, 64] used the ASCF 
method, which includes the electron-hole interaction energy, to estimate the optical gap by computing the 
total energy difference between the ground state and an excited state created by promoting an electron from 
the HOMO to the LUMO. 
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Figure 10.3: Difference in the optical gaps for the Si n H m nanoclusters predicted by several 
methods with respect to the DMC results of Williamson et al. [2]: ▲ LDA HOMO-LUMO 
single particle gaps, A LDA quasiparticle gap minus the empirical electron-hole exciton 
binding energy and corrected to include the electron-hole polarization enrgy from Ref. [23], 
□ TDLDA from Ref. [24] (effective gap) ■ TDLDA, • GW-BSE from Ref. [25]. 

nanoclusters: SiH 4 , Si2H 6 , SisHi 2 and SiioHi 6 (Figure 10.3). However, it should be stated 
that the accuracy of these GW-BSE results[25] have since come into question because the 
authors neglected the off-diagonal matrix elements of the self-energy operator [57]. The 
inclusion of these terms shifted the lowest singlet of SiH4 from 8.8 to 9.2 eV, which is closer 
to the DMC result of 9.1(1) eV. The full self-energy operator allows the unoccupied states 
to become more delocalized, which significantly reduces the electron-hole interaction [57]. 

A major focus of the work by Williamson et al. [2] was the comparison between the 
optical gaps predicted by DMC and DFT, including TDDFT. The DMC results for the op- 
tical gaps were found to be larger than the corresponding DFT-LDA single-particle gaps 
for all nanoclusters sizes. In addition, the DMC optical gaps are larger than the TDLDA 
gaps: this includes their own calculations and those of Vasiliev et al. [24]. The compari- 
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son between DMC and TDLDA is discussed in more detail in Section 10.6.1. Williamson 
et al. [2] also performed DFT calculations with the B3LYP functional [241]. The B3LYP 
results were found to be in good agreement with DMC, but for almost all nanocluster sizes 
the time-dependent B3LYP (TD-B3LYP) results were in worst agreement than the single- 
particle gaps: this is also true for the TD-B3LYP results of Garoufalis et al. [235]. In 
addition, Garoufalis et al. performed many-body multireference second-order perturbation 
theory (MR-MP2) calculations for Si nanoclusters as large as Si2gH 36 . The MP2 gaps were 
found to nearly coincide with the TD-B3LYP gaps. Porter et al. [26] also performed DMC 
calculations on the Si n H m , n + m < 34, nanoclusters for the first dipole allowed and dipole 
forbidden singlet transitions and the first triplet transitions. The DMC results were com- 
pared to corresponding singles-only configuration interaction (CIS) and TDLDA results, see 
Figure 10.4, with the DMC optical gaps approximately 1 eV larger than the TDLDA gaps. 
This difference between the DMC and TDLDA gaps is similar to the difference independently 
reported by Williamson et al. [2]. 

Benedict et al. [56] compared the absorption spectra for several of the smaller Si n H m nan- 
oclusters calculated by GW-BSE and TDLDA. In the GW-BSE calculations the screening 
is determined by the Hybertsen-Levine-Louie model [242, 132] and the optical gap is com- 
puted with DMC. The gross features of the two methods are similar for the larger Si n H m 
nanoclusters, due to the dominance of the Coulomb part of the interaction kernel which 
is the same in both treatments, but there are significant differences near the absorption 
edge. For the SiH4 molecule, the two methods produce markedly different results, with the 
TDLDA spectra appearing closer to the experimental result. 

10.3.5 The Ge - Si Crossover Question 

A great deal of theoretical work has been dedicated to the Ge and Si crossover question: 
whether or not there is a crossover in the optical gaps of the Ge and Si nanocrystals at a 
critical radius (diameter). This is of great interest because it is known that in the bulk limit 
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Figure 10.4: The lowest-bright (dipole-allowed) singlet excitation energies for several Si n H m 
nanoclusters by Porter et al. [26]. The experimental results for SiH 4 and Si2H 6 are from 
Itoh et al. [19] and the results for SisH^ are from Feher [27]. The first two peaks in the 
absorption spectrum are included for SiH 4 . 

the band gap of Ge is approximately 0.68 eV, which is in the near infrared region (NIR), is 
lower than the band gap of bulk Si, which is approximately 1.1 eV. 

Takagahara and Takeda [238], using an effective mass approximation (EMA), predicted 
the gaps of the Ge passivated nanoclusters would be larger than the Si nanoclusters for a 
critical radius smaller than 30 A. This is attributed to differences between the electron and 
hole effective masses of Ge and Si, with the charge carriers of Ge having a smaller effective 
mass. Hill et al. [239], using tight binding (TB), came to a similar conclusion as Takagahara 
and Takeda [238], but with a critical radius less than 12 A. Reboredo and Zunger [59], using 
an empirical pseudopotential approach, found no clear crossover for a radius as small as 
10 A. The empirical pseudopotential calculations indicated that the lowest conduction state 
changes from L derived to X derived below a critical diameter; hence the Ge nanocrystals 
become Si-like. Reboredo and Zunger [59] maintained that this phenomenon explained the 
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absence of a crossover. Weissker et al. in Refs. [63, 64, 65], using the ASCF method for DFT 
within the LDA and LSDA approximations, did not observe a crossover up to a radius of 
12 A. A recent paper by Nesher et al. [28] did not observe a crossover when comparing their 
TDLDA results for the absorption spectra for Ge n H m , (n + m < 247), to the corresponding 
TDLDA spectra for Si n H m by Vasiliev et al. [24]. The TDLDA optical gaps for Si n H m were 
larger than the Ge n H m optical gaps for all nanocluster sizes, including the optical gap of 
the "zero diameter" SiH 4 molecule (8.8 eV for SiH 4 compared to 8.3 eV for GeH 4 ). Both 
of these TDLDA studies calculated the absorption spectra within linear response theory, 
see Section 2.7.2 for a discussion. Real-time TDLDA calculations, see Section 2.7.3, by 
Tsolakidis and Martin [1, 119] also did not observe a crossover in the 1-2 nm diameter 
size range. In addition, ab-initio real-space pseudopotential calculations by Melnikov and 
Chelikowsky [62] found that while the ionization energies and electron affinities of the Si and 
Ge structures are nearly identical, the quasiparticle and single particle gaps of Si are larger 
by approximately 0.2 eV than those of Ge. 

Closely related to the crossover question is the issue of the oscillator strengths: the 
relative size of the oscillator strengths for the Si and Ge clusters near the absorption edge. 
Tsolakidis and Martin [1] observed that the Ge nanoclusters exhibit more oscillator strength 
when compared to the Si nanoclusters in the low energy region near the absorption edge. 
This was attributed to the presence of more transitions in the low energy spectrum of bulk 
Ge than bulk Si [243]. Many of these transitions are indirect and therefore forbidden in 
the bulk, but because of the relaxation of k selection rules due to quantum confinement 
are allowed in the nanoclusters. For both the Si and Ge nanoclusters it was found that 
the oscillator strengths of the indirect transitions are significantly enhanced. The results 
of Tsolakidis and Martin corroborate the ab-initio ASCF work within LDA by Weissker et 
al. [63], who observed that the Ge and Si nanoclusters behaved completely differently near 
the absorption edge (HOMO-LUMO gap). Weissker et al. [63] found that with increasing 
nanocluster size that the Si nanoclusters show a tail of weak transitions with decreasing 
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oscillator strengths, while optical transitions with large oscillator strengths occur for Ge 
nanoclusters near the HOMO-LUMO gap. 

10.4 The Importance of Surface Chemistry 

As we have already mentioned, quantum confined core states just represent one mechanism 
responsible for observed optical gap in the Ge and Si nanoclusters. The surface chemistry 
of the nanoclusters must be taken into account in order to properly explain their optical 
properties. 4 Reducing the size of the clusters to the molecular limit results in an increase 
of the surface area to volume ratio; hence, the surface chemistry is expected to strongly 
influence the optical properties [66]. 

The most important mechanisms by which the surface chemistry affects the optical prop- 
erties are by surface terminations, passivation and reconstruction. Termination refers to how 
the surface is terminated, specifically whether the surface is fully passivated or not. Pas- 
sivation refers to the choice of passivates, such as single bonded passivates like hydrogen, 
or double bonded passivates like oxygen. Surface reconstruction refers to the structural 
relaxation of the nanocluster that occurs when surface passivants are removed. 

TDLDA calculations by Vasiliev et al. [69] found that including a single oxygen atom 
on the surfaces of the Si3 5 H 3 4 and Si 2 9H 34 nanoclusters (to create Si35H 34 and Si 2 9H 34 0) 
effectively reduced the optical gap by 1-2 eV. It is believed that oxygen introduces extra 
localized states near the gap [69]. This work was motivated by the experimental results of 
Wolkin et al. [68], who observed a large redshift of the photoluminescence in porous silicon 
after exposure to air. More specifically, a redshift of the order of 1 eV was reported for 
the samples composed of crystallites smaller than 2 nm which was attributed to surface 
oxidation of silicon nanocrystals. 

DFT and DMC calculations by Puzder et al. [66] showed that significant changes occur 

The work cited in this section deals with Si nanoclusters; however, it is reasonable to expect that any 
conclusions can be extended to Ge nanoclusters 
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in the gap of fully hydrogenated nanoclusters when the surface contains passivants other 
than hydrogen. Substituting hydrogen for another single bonded passivant was found to 
have an insignificant affect on the optical properties of the Sissf^ nanocluster. In contrast, 
double bonded passivants (Si35H 34 Y) reduced the gap by at least 1 eV, this reduction was 
accompanied by a local distortion of the sp 3 network on the surface. It was proposed that 
the origin of the wide discrepancy between previous theoretical results [24, 236, 244] for 
hydrogen passivated Si nanoclusters and experimental results in Refs. [227, 232, 245] could 
be due to oxygen contamination. 

Mitas et al. [67] measured the effect of surface reconstruction for the Si2gH 36 nanocluster: 
the absorption spectrum was determined by CIS and subsequently shifted such that the CIS 
absorption edge (lowest excitation energy) was in agreement with the optical gap determined 
by DMC. Mitas et al. [67] found that removing 12 hydrogen atoms from the fully passivated 
Si2gH36 nanocluster and allowing for surface reconstruction lowered the absorption edge from 
approximately 4.8 eV to approximately 3.5 eV. Removing an additional 12 hydrogen atoms 
(to form Si2gH 12 ) and allowing for reconstruction dramatically lowered the absorption edge 
to approximately 0.1 eV. In addition, a Density Functional study Lehtonen and Sundholm 
[70] found that the absorption threshold of the fully passivated Si29H 36 nanocluster lies in 
the ultraviolet (320-340 nm) 3.87-3.65 eV region, whereas the absorption threshold of the 
Si29H24 nanocluster is about 450 nm or 2.76 eV. 

10.5 Computational Method 

The computation details regarding the QMC trial wave function of the Ge n H m nanoclusters 
are identical to the description in Section 9.2, for the GeH 4 and Ge2H 6 molecules. All DMC 
calculations use a time-step of either 0.05 a.u. or 0.025 a.u. 

The optical excitations are calculated in QMC following the method in Section 4.13. 
Only the trial wave function for the ground-state is optimized; optimization of the ex- 
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(a) HOMO (b) 2nd LUMO 

Figure 10.5: GesHi 2 : Isosurfaces for the molecular orbitals used to compute the optical gap 
(red +, blue -). 

cited state wave function was found to have an insignificant effect on the optical gap. Fig- 
ures 10.5, 10.6, 10.7 and 10.8 plot the isosurfaces of the HOMO and excited state MOs used 
in computing the optical gaps of Ge 5 H 12 , Gei H 16 , Gei 4 H 20 and Ge 2 9H 36 . 5 The isosurfaces 
of the MOs for GeH4 and Ge 2 Hg can be found in Figures 9.3 and 9.4. Note that the second 
LUMO is used to compute the optical gap of Ge 5 H 12 , since the HOMO — > LUMO transition 
is optically forbidden. 

For the larger nanoclusters, it was necessary to constrain the weights in the correlated 
sampling to unity for stability reasons in the optimization [192, 51]. For a discussion of 
constrained weight optimization see Section 6.8.4 . 

10.5.1 The LDA Relaxation of the Structures 

The LDA relaxation of the structures closely follows the method of Tsolakidis and Martin 
in Refs. [1, 119]. The structural relaxations include all of the constituent atoms; all calcu- 
5 All isosurfaces made using the XCrySDen package [220]. 
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(a) HOMO (b) LUMO 

Figure 10.6: GeioHi 6 : Isosurfaces for the molecular orbitals used to compute the optical 
gap. 




(a) HOMO (b) LUMO 

Figure 10.7: Gei 4 H 20 : Isosurfaces for the molecular orbitals used to compute the optical 
gap. 
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(a) HOMO (b) LUMO 

Figure 10.8: Ge2gH36: Isosurfaces for the molecular orbitals used to compute the optical 
gap. 

lations were performed using the SIESTA DFT package [181, 246, 247, 248]. The SIESTA 
calculations featured an auxiliary real space cutoff that is equivalent to a plane-wave cutoff 
of 70 Ry. 

All of the SIESTA calculations used a relativistic LDA pseudopotential, generated by the 
Troullier-Martins [206] method. The choice of pseudopotential is one of the most important 
differences between the results presented in this thesis and those of Tsolakidis and Martin 
and the majority of the previous work on Ge clusters. In addition, in SIESTA it is necessary 
to use a pseudopotential for the H nulcei: this is because SIESTA cannot deal with the cusp 
at the location of the H nuclei. 

The basis for each Ge atom consists of 13 numerical atomic orbitals (NAO's) per atom: 
two radial shapes to represent the two 4s states with confinement radii r s = 5.08 a.u., two 
radial shapes to represent the six 4p states with confinement radii r p = 8.04 a.u. and a 
polarization d shell with confinement radius r^° l = 8.04 a.u. The H basis set includes five 
NAO's per atom: two radial shapes for Is states with confinement radii r s = 6.05 a.u. and 
a p polarization shell with a confinement radius of r p = 6.05 a.u. In SIESTA this is the 
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default DZP basis for both Ge and H. The basis for Si is very similar to the Ge basis, with 
the confinement radii r s = 6.12 a.u., r p = 7.85 a.u., and r^° l = 7.85 a.u. for the polarization 
d states. 

10.6 Results for the Optical Gaps 

10.6.1 Comparison of DMC to TDLDA for Ge Clusters 

The TDLDA spectrum has generally been observed to evolve from a discrete spectrum 
with large oscillator strengths near the absorption edge for the smallest Ge n H m and Si n H. m 
nanoclusters to a quasicontinuous spectrum with small oscillator strengths for the larger 
nanoclusters [56, 24, 28]. As we will see this will have a consequence when determining the 
optical gaps of the larger nanoclusters. All of the nanoclusters under consideration are well 
below the bulk limit where the TDLDA results are expected to converge to the LDA results 
[118]. 

It should also be stated that is has been observed by several researches that TDLDA 
does not dramatically correct the value of the optical gap of the Si and Ge nanoclusters 
over LDA; however, TDLDA gives better results for the absorption spectrum over the entire 
energy range [2, 56, 1, 119, 249]. This is due to the inclusion of dynamical correlations within 
TDLDA which corrects the oscillator strengths of the transitions and moves spectrum to 
higher energies (blueshifts) compared to the LDA spectrum. Tsolakidis and Martin observed 
that TDLDA corrected the gap by only about 0.2 eV relative to LDA [1]. 

From Figure 10.9 it can be seen that the DMC+CPP gaps for the Ge n H m nanoclusters 
are larger than the corresponding TDLDA gaps reported by Nesher et al. [28], which is in 
agreement with the previous DMC and TDLDA comparisons made for the Si n H m nanoclus- 
ters by Williamson et al. [2] (see Figure 10.3) and Porter et al. [26] (see Figure 10.4). Note 
that the TDLDA gaps shown in the figure are "effective" optical gaps, which are described 
below in the discussion about silicon clusters. Tsolakidis and Martin [1] have also performed 
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Figure 10.9: Comparison of the present DMC+CPP results for the lowest gaps of the 
Ge n H m , (n + m) < 65, nanoclusters (•) to the TDLDA results of Nesher et al. [28] (♦). 
The TDLDA results are "effective gaps" as described in the text. 

TDLDA calculations for the Ge n H m nanoclusters, with Ge2gH 36 as their smallest structure. 
Their value for the lowest gap of 3.66 eV for Ge 2 9H 36 [119] is approximately 0.5 eV lower 
than the effective optical gap reported by Nesher et al. [28], which is also consistent with 
the comparison of the lowest and effective gaps calculated using TDLDA for silicon clusters 
[2]. See Figure 10.10 for the TDLDA absorption spectra from both Nesher et al. [28] and 
Tsolakidis and Martin [1]. 

The size of the difference between the optical gaps from DMC and TDDFT strongly 
depends on how the optical gap is assigned in TDDFT. Williamson et al. [2] choose to 
compare their DMC results for the Si n H m , (n + m) < 163, nanoclusters to the first non-zero 
transition in the TDDFT spectrum, maintaining that this is the closest comparison between 
their DMC results and TDLDA. Their DMC results for the optical gaps were found to be 
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Figure 10.10: Left: TDLDA results for the absorption spectra of the Ge n H m nanoclusters 
by Nesher et al. [28]. Right: TDLDA results for the absorption spectrum of Ge2gH 36 by 
Tsolakidis and Martin [1], plotted on two energy scales. 



larger by 1-2 eV than the TDLDA gaps for all nanocluster sizes; the size of the difference was 
smaller when comparing to the "effective" optical gaps from the TDLDA results of Vasiliev 
et al. [24]. Vasiliev et al. [24] and Nesher et al. [28] defined the TDLDA optical gap as 
the point at which the integrated oscillator strength exceeds a threshold of 2% of the total 
absorption spectrum. The argument behind this assignment is that the oscillator strengths of 
the transitions below the integration threshold would be below the experimentally detectable 
limit [24, 28] . For the smallest nanoclusters these different methods of assigning the optical 
gap are in agreement; the oscillator strengths of the first excitations exceeds the threshold. 
A difference between the two approaches begins to appear for the larger nanoclusters; the 
integration threshold is exceeded only after integrating over many smaller peaks in the 
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absorption spectrum. 



10.6.2 Comparison to Previous DMC Results for Si Clusters 

From Figure 10.11 it is immediately noticeable that a difference exists between the two sets 
of previous DMC calculations for the optical gaps of the Si n H m nanoclusters. This is not 
a disagreement, but rather the fact that Porter et al. [26] have given the gap for a higher 
transition whereas the other QMC results are for the lowest gap. For the SiioHi6 nanocluster 
Porter et al. [26] choose to assign the third dipole allowed transition from CIS to calculate 
the optical gap in DMC. The justification for this choice was that the oscillator strengths of 
the first two transitions were considerably smaller; at the CIS level the difference between 
the first transition and the third transition was about 0.7 eV. 




1 2 5 10 14 29 

Number of Ge (Si) atoms 



Figure 10.11: Comparison of our DMC results for Ge n H m (•) to the DMC results for the 
corresponding Si nanoclusters by Porter et al. [26] (A) and Williamson et al. [2] (V). We 
have also included our own DMC results (o) for Si5H 12 and SiioH 16 . 
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Figure 10.11 also shows results of our own DMC calculations for the optical gaps of SisH^ 
and SiioH 16 in order to have a direct comparison of the Si and Ge clusters, and to compare 
with the previous work on Si clusters. We have not included the CPP for Si, which is a 
much smaller effect in Si than in Ge and cannot be approximated by a local potential [43]. 
Otherwise, all of the computational details, including the use of a HF pseudopotential [12] 
and a HF derived trial wave function, are identical for the Si n H m and Ge n H m nanoclusters. 
Our DMC gap for SisH 12 lies in between the DMC results of Williamson et al. [2] and Porter 
et al. [26] , while our DMC gap for SiioHi 6 is nearly identical to the result of Williamson et 
al. [2]. 

It is gratifying that the lowest gaps calculated by QMC are in good agreement, even 
though there are differences in the calculations. Williamson et al. [2] used LDA pseudopo- 
tentials to replace the Si and H cores and DFT-LDA single particle orbitals to construct 
their Slater determinants. The single particle orbitals were first transformed by a unitary 
transformation to form a set of truncated, maximally localized Wannier (MLW) functions 
[250, 251], which have been shown to increase the efficiency of QMC. 6 In contrast, our work 
and the calculations of Porter et al. [26] used a relativistic HF pseudopotential for Si. To 
obtain the Slater determinant part of the trial wave function, Porter et al. [26] used the 
results of a CIS calculation. Porter et al. [48, 26] argued that the CIS derived trial wave 
function was considerably better than the HF derived wave functions for the excited states 
based on tests for the SiH 4 molecule. It is important to note that CIS, which involves only 
single-substitutions, differs from HF for the excited states; the ground state for CIS is the 
HF ground state. 

Table 10.1 contains all of our QMC results for the optical gaps of the Ge n H m , (n + m) < 

65. This includes the VMC+CPP and DMC+CPP gaps along with the CPP corrections as 

calculated by Eq. (9.2). This table serves as a reference for future comparisons. 

We have not been able to observe any clear size dependence for the CPP correction to the 
6 Recent articles have claimed near linear scaling in QMC by use of MLW functions [252, 253]. 
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Table 10.1: QMC optical gaps of the Ge n H m , (n + m) < 65, nanoclusters with the CPP 
correction ACPP. All energies are in eV. 



Cluster 


VMC+CPP 


ACPP 


DMC+CPP 


ACPP 


GeH 4 


9.7651(74) 


-0.2520(14) 


8.934(13) 


-0.2272(21) 


Ge 2 H 6 


7.707(11) 


-0.2357(25) 


7.336(9) 


-0.2235(16) 


Ge 5 H 12 


6.845(18) 


-0.318(3) 


6.562(31) 


-0.3045(46) 


GeioHi6 


6.436(27) 


-0.074(3) 


6.36(18) 


-0.0678(89) 


Gei 4 H 2 o 


5.495(24) 


-0.038(3) 


5.62(12) 


-0.033(14) 


Ge 2 9H 36 


5.480(73) 


-0.133(16) 


5.09(21) 


-0.11(2) 



optical gaps. We believe that the CPP correction strongly depends on the properties of the 
excited state used to calculate the optical gap in DMC. For the larger Ge n H m nanoclusters, 
the size of the CPP corrections are roughly the same magnitude as the error bars of the 
optical gaps. This is not surprising since in the bulk limit the CPP correction for the 
indirect band gap of Ge is estimated to be approximately 0.2 eV. This estimate is based on 
quasiparticle band energies provided by Eric Shirley [20]. For the largest nanocluster in our 
study, Ge 2 gH36, the CPP correction is approximately 0.1 eV while the error bar for the optical 
gap is approximately 0.2 eV. In addition, it should be stated that it becomes impractical 
to reduce the error bars of the energy differences below 0.2 eV for the larger nanoclusters 
due to the lack of experimental results for comparison. Furthermore, systematic errors in 
DMC due to the fixed-node error the evaluation of the non-local pseudopotentials within 
the locality approximation limit the accuracy of the optical gaps and obviate reducing the 
error bars to the level of chemical accuracy (approximately 0.04 eV). 
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Chapter 11 

Conclusions, General Observations 
and Future Directions 

This work presented in this thesis is related to four topics: the treatment of electronic 
correlations among valence electrons, inclusion of the effects of the core on the valence 
states, applications of Quantum Monte Carlo (QMC) and optical properties of semiconduc- 
tor nanoclusters. The highly accurate QMC method can treat electronic correlations at a 
many-body level; however, the large energies of core states leads to large fluctuations and it 
almost always necessary to remove the core-electrons of heavy elements and treat the valence 
electrons using a pseudopotential. Therefore the core-valence interactions are treated at a 
different level of accuracy than the valence-valence interactions. The post transition element 
Ge (Z=32) is of particular interest since the usual method of treating the core- valence inter- 
actions by a pseudopotential from an independent particle approximation is unsatisfactory. 
An improved method is to augment a Hartree-Fock (HF) pseudopotential with a core polar- 
ization potential (CPP) to take into account core-valence correlations at a many-body level, 
which are large in Ge because of its shallow, polarizable core. 

An example of a class of systems where electronic correlations are known to be important 
are semiconductor nanoclusters. The optical properties of nanoclusters constructed from Ge 
or Si have recently been the focus of a growing body of experimental and theoretical work. 
These nanoclusters have a widened band gap relative to the bulk due to quantum confinement 
that is inversely proportional to the size (diameter) of the nanoclusters. Therefore, Ge and 
Si nanoclusters offer the possibility of tunable photoluminescence which has a tremendous 
number of potential applications. This thesis is one of the earliest applications of QMC to 
the optical excitations of Ge nanoclusters. 
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QMC calculations were presented for the ground state and excited states of the Ge atom, 
molecules and hydrogen terminated Ge nanoclusters, with Ge 2 9H 36 as the largest system in 
the study. As mentioned previously, the interacting system of valence electrons was explicitly 
treated at the many-body QMC level using a Hartree-Fock pseudopotential to which is added 
a CPP to account for the core-valence correlations. In addition, the effect of core-valence 
correlation can be quantified by the estimator of the CPP. The CPP correction to excitations 
can be evaluated as the total energy difference of the ground state and excited state CPP 
estimators. 

The effect of the CPP was found to be significant for the total energy and excitations of 
the atom and small molecules. Analysis of these cases shows that size of the CPP correction 
for the excitations largely depended on the nature of the excited states; mainly the relative 
Ge s and p character of the excited state. The CPP correction is largest for the excited 
states with a dominant Ge s character. For the larger Ge n H m nanoclusters it was found that 
the size of the CPP corrections were much smaller than in the Ge atom and the molecules 
GeH 4 and Ge2H 6 , with no clear size dependence. This can be understood due to the strong 
mixing of s and p Ge atomic orbitals for the excited states. 

The QMC optical gaps for the Ge n H m nanoclusters are compared to the optical gaps 
determined from time-dependent density functional theory (TDDFT) in the adiabatic local 
density approximation (TDLDA) [28, 1] and to QMC results for the corresponding Si nan- 
oclusters [2, 26]. The calculations were carried out using structures optimized with DFT 
calculations in the Local Density Approximation (LDA) and using a Gaussian basis set for 
the trial wave functions in QMC. Optical excitations were treated by promoting one electron 
from the highest occupied molecular orbital (HOMO) to the lowest unoccupied molecular 
orbital (LUMO) orbitals, as has been done in previous work [57, 2]. The QMC results for 
the gaps were found to be larger than the TDLDA gaps by roughly one electron volt, which 
is in agreement with previous comparisons between QMC and TDLDA for Si n H m in the - 
2 nm diameter size range [26, 2]. In order to make direct comparisons of Ge and Si clusters 
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it was necessary to perform calculations of Si (S15H12 and SiioHi 6 ) using the same methods, 
with results that agree well with previous calculations. The primary conclusion is that the 
gaps for Ge clusters are very close to those of Si clusters, as shown in Figure 10.11. It is 
worthwhile to note that previous TDLDA [1] and delta self-consistent field theory within 
DFT-LDA [63] calculations found that oscillator strengths are significantly higher for Ge 
than for Si clusters. Thus the work in this thesis supports the conclusion that the gaps of Si 
and Ge clusters are very close, but nevertheless Ge clusters have increased optical absorption 
at low energies due to increased oscillator strengths. 

A major part of the work for this thesis was dedicated to the design and development of a 
new Quantum Monte Carlo program named qmcPACK [3]: this includes writing the actual 
code and extensive testing. This code can perform Variational Monte Carlo and Diffusion 
Monte Carlo in the Fixed-Node approximation; the completion of these two components was 
a prerequisite for the results in the preceding chapters. A factor in the design of qmcPACK 
was versatility in the choice of the trial wave function. qmcPACK can import trial wave 
functions from a variety of sources, such as the SIESTA DFT package, Gaussian 98/03, 
Ab-Init and orbitals on radial or three-dimensional grids. In addition, qmcPACK can now 
handle periodic boundary conditions and can do twist- averaging boundary conditions [254]. 
The long-ranged electron-electron and elecron-ion interactions can either be handled by the 
traditional Ewald break-up method 1 or by the method developed by Natoli and Ceperley 
[255] and modified by Esler [256]. 

In the future it is desirable to use better trial wave functions, especially for excited states. 

The most common approach to construct an excited state in QMC is to promote an electron 

from an occupied to an unoccupied excited state orbital in the Slater determinant part: the 

single-particle orbitals usually come from a preceding Hartree-Fock or DFT calculations. It 

is well known that the excited states from both HF and DFT in the standard approximations 

such as LDA suffer from deficiencies. One possible improvement is to use multiple deter- 
1 See Appendix F of Rcf. [29] for a discussion. 
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minant wave functions from Configuration Interaction or Coupled Cluster approximations, 
such approaches have been used in Refs. [48, 26, 67]. Another possibility is the correlated 
geminal approach discussed in Ref. [190]. A minor modification is to use the Pade-Jastrow 
(see Section 6.6.1) for the hydrogen-electron correlations; the present work uses a cuspless 
function (see Section 6.6.2). The purpose of using the Pade-Jastrow is to better satisfy the 
cusp condition that occurs at the location the hydrogen nuclei, which is only approximately 
satisfied by the Slater type orbitals for the hydrogen Is and 2s orbitals. The Jastrow pa- 
rameter a can be optimized as opposed to being explicitly set to satisfy the cusp condition: 
a — Z, where Z is the charge of ion core. 

Possible future projects include simulating larger Ge n H m nanoclusters and performing 
calculations on bulk Ge. The results for the gaps of the nanoclusters can be compared to 
TDLDA results in Refs. [1, 28] and QMC results for Si nanoclusters in Ref. [2]. This would 
be the most obvious extension of the current work. For the solid, QMC can be used to 
calculate the band structure of Ge; of particular interest is the effect of the CPP on the 
band gaps, especially the direct and indirect band gaps of Ge. Previous QMC studies of 
Ge have reported band gaps at a few special /c-points, but the details of these calculations 
regarding the choice of pseudopotential and trial wave function are much different from the 
approach outlined in this thesis [49, 50]. These calculations were an early application of 
QMC to periodic systems and were therefore intended to establish the viability of QMC for 
solids, rather than highly accurate results. 
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Appendix A 
Appendix 



A.l The Spherical Harmonics 

The spherical harmonics are given by the expression 



where P™ is an associated Legendre function defined by 

( 1 \m Jl+m 

P™( x ) = i— Hi - x 2 ) m/2 ——(x 2 - l) 1 . (A.2) 

In order for P™ to be finite on the interval —1 < x < 1 the parameter / must be a zero or a 
positive integer and the integer m is restricted to the values — /, —(I — 1), . . . , 0, . . . , (I — 1), I. 
Eq. (A.2) is valid for both positive and negative m : it can be shown that 

(I - mV 

p-=(-l)™| r _^j r (x); (A.3) 
hence the spherical harmonics obey the following relation 

y,,_ m (M) = (-i) m [n»(M)r. (A.4) 

The spherical harmonics obey the normalization and orthogonality conditions over the 
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sphere 

f dQ [Y lm (9, <t>)\*Y Vnf {p, 0) = 8 w 8 mm ,, (A.5) 



where 

»2tT 

dVt = I d<t> I dQ sinO 



'o Jo 

is the differential area element on the surface of the sphere. Another important relation 
obeyed by the spherical harmonics is the completeness relation 

oo I 

E [Yi m (e',<f>')}*Yim(e,<f>)=6(<f>-(f>')6(cos6-cos6'), (A.6) 

1=0 m=-l 

which in the more common Dirac notation is 

oo I 

\ Y l™)( Y l™\ = 1- (A.7) 

1=0 m=-l 

Therefore, it is possible to express any function of 9 and 0, (on the surface of a sphere) in 
terms of the complete set of spherical harmonics 

oo / 



fM) = Y< E ^imYUOA)- (A.8) 

1=0 m=-l 

with the coefficients Q m given by 

c lm = j dn f(9, <P) [Y lm (6, 0)]* (A.9) 
Another important identity is the summation rule 

E |>WM)l 2 = ^; (A.io) 

m=— I 

this identity has important implications for closed-shell atomic systems. It should be noted 
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that this section largely follows the formalism of Jackson [111] and that different conventions 
exist for dealing with the phase factor, (— l) m , of the spherical harmonics [180]. Table A.l 
contains the spherical harmonics up to I — 3. 

Table A.l: The spherical harmonics Yi m (8,(p) for the s,p,d and / channels. 



I Lm Yi 



I II I 



0,0 



1,±1 T\ ^-sm6e ±iq 
p V 

1,0 \/%cos9 



2, ±2 y^sin 2 0e ±2 ^ 

A i 

2,±1 TV If sm^cosfle^ 9 

5 (1^2, 



2,0 ,7^(3 cos 2 ^ 



3, ±3 T ^sin 3 #e ±3 ^ 

/ 3, ±2 ySsin 2 flcosfle ±2 ^ 

3,±1 T y / ^sin^(5cos 2 -l)e 

3,0 yQy5cos 3 0-3cos0) 



A. 1.1 The Real Spherical Harmonics 

The spherical harmonics, see Appendix A.l, are complex functions except for the components 
m = 0, i.e. when the z— component of the angular momentum is equal to zero. We would 
like to avoid using complex functions in electronic structure codes and determine a real 
counterpart to the spherical harmonics, i.e. Y\ m — > S\ m . The convention for the real spherical 
harmonics used in the Gaussian03 [11] package: 



Sim — < 



(-l) m ^/2^(Y} mV ) m>0 
Y[° m = 

(-1)"V23(Y; H ) m<0 



(A.11) 



198 



where the factor a/2 is necessary for normalization. Table A. 2 contains the real spherical 
harmonics for the s,p,d and / angular momentum channels. The functions can be trans- 
formed from the Cartesian representation to spherical coordinates by the following change 
of variables 

x = r sin(#) cos(0) (A. 12) 

y = r sin(#) sin(0) (A. 13) 

z = rcos(0). (A.14) 



Table A. 2: The real spherical harmonics Si m (x,y, z) in Cartesian coordinates for the s,p,d 
and / channels. The second column is the index l\ m = 1(1 + 1) + m. 



lm 



lm 



lm ) 



s 



1 

p 1 
1 



2 
2 

d 2 
2 
2 







1 

2 
3 



-2 
-1 

1 

2 



4 
5 
(3 
7 



-3 

-2 

-1 



1 

2 

3 



9 

10 

11 

12 

13 

14 

15 



{0,0,0} 



4tt2/ 



47T 

3 



£{0,1,0} 



{0,0,1} 



\XZ 

^ - v 2 ) 



'g{M,o} 

^{0,1,1} 



-|/,2^} 



35 /o 2 



y 2 )y 



v^(^ 2 - ^ 

^(5z 2 - r 2 )x 
^(x 2 - y 2 )z 

i(x 2 - 3y 2 )x 



^{3x 2 -3y 2 ,-Qxy,0} 
^{yz,xz,xy} 
§^{2xz,-2yz,x 2 -y 2 } 
16 { — 6x2;, —6yz, 6z 2 — 3x 2 



: {Az 2 — 3x 2 — y 2 , —2xy, 8xz} 
^{2xz,-2yz,x 2 -y 2 } 
J|{3x 2 - 3^,-6^,0} 
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A. 2 Radial Grids 



This section contains the analytic forms of the three types of radial grids used in this thesis. 
Each of the grids has N points and the index i runs from to N — 1. The most basic grid 
is the linear grid. The logarithmic grids are desirable for applications that require a large 
number of points near the origin, such as atoms with the —Z/r Coulomb potential. The 
initial point of the logarithmic grid r$ cannot be zero and is typically chosen to be a very 
small number. In contrast the initial point of the zero based logarithmic grid is zero. 

A. The linear grid: 

B. The logarithmic grid: 




(A.16) 



C. The zero based logarithmic grid: 

r t = B (exp(Az) - 1) (A. 17) 



A. 3 One-Dimensional Cubic Splines 

Given a function defined on a grid, {xi},i = 1 . . . N, such that y\ = y(xi), we would like 
to evaluate the function (interpolate) at a point x in the interval [xj,Xj + i]. The linear 
interpolation formula 

y = A yj + By j+1 (A. 18) 

where 

A — X ^2 X B = 1 - A = X - , (A.19) 

30 j _|_ ~^ 3j j j ~\~ 1 j 
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satisfies the boundary conditions at the endpoints: y(xj) = yj and y{xj + \) = yj+i- The 
problem with this approach is that over the entire range of the function, [x\, xjy], the result of 
using Eq. (A. 18) would be a series of piecewise linear functions with a zero second derivative 
within each of the intervals and an undefined or infinite second derivative at the interval 
boundaries, i.e. the grid points {xi}. Ideally we would like to construct an interpolation 
function with a smooth first derivative and a continuous second derivative, both within the 
intervals and at the the grid points. 

By adding a cubic polynomial to the linear interpolation equation, Eq. A. 18, within each 
interval, it is possible to construct an interpolation function that varies linearly in the second 
derivative. Assume for a moment that we know the values of the second derivative at each 
grid point: y" = d 2 y(xi)/dx 2 , i = 1 . . . N. We can construct a cubic polynomial that not only 
has the correct second derivatives at the end points of each interval, y" and y'j + x, but also 
evaluates to zero at those same endpoints. The reason the cubic polynomial must be zero 
at the endpoints is to not spoil the agreement that is already built into the linear function. 
A function constructed from these principles is given by 

y = Ay, + By j+1 + Cy] + Dy» (A.20) 



where 



C = ^(A 3 - A)(x j+1 - x,) 2 D = 1 -{B 3 - B)(x j+1 - x 3 ) 2 . (A.21) 



To explicitly check that this function does indeed satisfy the conditions at the endpoints, 
take the derivatives 

dy Vj+i-Vj 3A 2 -1 3B 2 -1 . dq\ 

ax x j+ i — Xj 6 J 6 J 
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and 

dx 2 

The second derivative is continuous across the boundary between two intervals, i.e. [xj-i, Xj 



^ = Ay» + By» +1 . (A.23) 



and and obeys the conditions at the endpoints since at x — Xj, (A = 1,B = 0) 

and at x = Xj + ±, (A = 0, B = 1). 

We had made the assumption that the values of the second derivative are known at the 
grid points, which of course they are not. By imposing the condition that the first derivative 
is smooth and continuous across the boundary between two intervals, it is possible to derive 
a set of equations to generate the y"' s. Evaluate Eq. A. 22 at x = Xj for the interval [xj-±, Xj], 
and set it equal to the same equation evaluated at x = Xj in the interval [xj, Xj+i]; rearranging 
the terms we arrive at 



Xj Xj+i ,i _|_ Xj+i Xj-i it _|_ x j+i x j ii _ Vj+i Vj _ Uj Vj+i 
6 3 6 ^ Xj-\-\ — Xj Xj — Xj.\-\ 



where j — 2 . . . N — 1. To generate a unique solution for the system of iV — 2 equations, we 
have to impose boundary conditions at X\ and x^. The possibilities being either to set y'[ 
and y" N to zero, the natural cubic spline, or, if you want to make the first derivative at the 
boundaries to have a specified value, use y[ and y' N to calculate the second derivatives at 
the endpoints using equation. For an excellent discussion of one-dimensional cubic splines 
see Ref. [189]. 

A. 4 The Runge-Kutta Algorithm 

The second order Runge-Kutta method [257] can be used to solve first order differential 
equations of the form 

t = '<*>■ < A - 25 > 
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where the solution y(x) is the integral of f{x). The integral equation equivalent to Eq. (A. 25) 



is 



y(x) = / dx' f(x') +y . 



(A.26) 



To derive the second order Runge-Kutta method let y{x) and f(x) be discrete functions on 
a uniform grid x of spacing h : x n = xq + nh(n = 0, 1, 2, . . .). Taylor expand y(x) around x n 
to yield 

1 _ 1 _ 

(A.27) 



y n+1 = y n + hy' n + ^h 2 y'i + ^h 3 y' r '! + 0[h A ] 



which by using Eq. (A. 25) becomes 



2/n+l = Vn + hf n + ^h 2 f n + ^tffl + 0[/l 4 ] 



(A.28) 



Using the formula for the first derivative 



f 



fn+l fn + ^ 2 j 



(A.29) 



Eq. (A.28) can be written, up to a third order approximation, as 



y n +i = y n + h 



+ 0[h s 



(A.30) 



The algorithm generates the sequence y n , with the boundary condition y(xo) = yo, according 
to the following rules: 



h = hf n 
^2 = hf n+ i 



(A.31) 
(A.32) 
(A.33) 
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A. 5 The Numerov Algorithm 

The Numerov algorithm [257] is used for integrating second-order differential equations of 
the form 

Pi + k 2 (x)y = S(x). (A.34) 



To derive the Numerov algorithm Taylor expand the function y(x) around x — 0, 

y(x) = y + xy' + ^x 2 y" + ^x 3 y"' + G[x% (A.35) 

Let y(x) be a discrete function on a uniform grid x of spacing h : x n = Xo+nh(n = 0, 1, 2, . . .). 
Now it is possible to write y n ±i in terms of y n and the derivatives of y(x) evaluated at x n , 

y n ±i = y(x = x n ±h) = y n ± hy' n + ±h 2 y: ± ±h*tf + i/i 4 <" ± ^h 5 y^ + 0[h ^ {AM) 
The equation for the second derivative of y(x) at x = xq + nh follows by adding y n +i and 

Vn-l , 

(y n +i-2y n + y n -i) = y „ + 1^,,, + (A 3?) 

the 0[h 2 ] error has been explicitly written for reasons that will become clear shortly. From 
the differential equation in Eq. (A.34), it is possible to derive 

y'n = £ (-*"»+ a) l_ (a.38) 



dx 2 

(k 2 y) n +i - 2{k 2 y) n + {k 2 y) n . 



h? 
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Now substitute Eq. (A. 37) into Eq. (A. 39) and rearrange the terms to produce the Numerov 
algorithm: 

h 2 5h 2 h 2 

(1 + ^^+2)^+2 - 2(1 - — *hi)Vn+i + (1 + ^k 2 n )Vn = 

^(S n+2 + 10S n+1 + S n ) + 0[h% (A.39) 
The necessary initial conditions are for the first two values of y : 

y(xo) = Vo and y(xi) = y x , (A.40) 

to start the integration. 

A. 5.1 Application to the Radial Schrodinger Equation and the 
Need for Logarithmic Grids 

The radial Schrodinger equation 

1 d 2 1(1+1) 



V(r) 



2 dr 2 2r 2 

can be represented in a form in Eq. (A. 34) with S(r) = and 



u n i(r) = e n iu nl (r). (A.41) 



k 2 (r) = 2[e nl - l -^±^--V(r)}. (A.42) 



Eq. (A.41) can be solved for bound states (e n i < 0) with the boundary conditions 



limu n z(r) — ► r(l — Zr), 1 = 

r— >0 

\imu n i(r) — > r l+ \ l>0 (A.43) 

r— >0 
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and u n i(r) — ► for r — > oo. For the case of a bare-core atomic potential V(r) = —Z/r, 
which has a strong cusp at the origin, it is necessary to use a grid with a high density of 
points near the origin. For this reason we use a logarithmic grid y = log(r), which for a 
given number of grid points iV has a much higher concentration of points than the linear grid 
near the origin. Since the Numerov algorithm only applies to uniform grids it is necessary 
to make the following change of variables 



y = log(r) and Z nl (r) = ^p- (A.44) 



which leads to the modified Schrodinger equation 

+ k' 2 (r)Z nl (y) = 0, (A.45) 



d 2 Z n i(y) 



dy 

with 

k' 2 (r) = 1r 2 \e„ - ^ 

2r 



k' 2 {r) = 2r 2 [e nl - - V(r)}, (A.46) 



and the boundary conditions 



limZ ni (r) — ► y/r(l-Zr), I 



r->0 

limZ ni (r) — > r l+ ^, I > 0. (A.47) 
i — >o 

To solve the Numerov algorithm it is necessary to make an initial guess for the eigenvalue 
e° z , where the subscript represents the zeroth iteration. It is necessary to iterate until 
the difference between e l nl — e^J 1 has reached a specified tolerance, usually around (10~ 9 — 
10~ 12 ). For the solution to be acceptable it must have the correct number of nodes and the 
boundary condition at r = oo is satisfied. The eigenvalue is adjusted for each iteration by 
specifying a lower and upper bound, with the eigenvalue chosen to be the average. It is also 
necessary to determine the classical turning point, where k' 2 (r) changes sign, all of the nodes 
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of the solution must be inside this radius. In the classically forbidden region the solution 
exponentially decays. The boundary condition near the origin are utilized to determine i/q 
and yi. the Numerov algorithm integrates outward until the solution exponentially grows 
in the classically forbidden region or crosses the axis to create an extra node. For the case 
where the integration produces too many (few) nodes, lower (raise) the eigenvalue; for a 
divergence in the classically forbidden region raise the eigenvalue. The eigenvalue can either 
be raised (lowered) by raising (lowering) the lower (upper) bound of the eigenvalue. 

A. 6 Derivation of the Diffusion Part of the Green's 
Function 

The following derivation is given for a one-electron system, but the results are generalized to 
the iV-electron system. The Green's function for a one-electron system with the Hamiltonian 
[H = T = — V 2 /2] is given by the expression 



Insert two realizations of the unity operator in the momentum space representation 



and use the fact that the kinetic energy operator T is diagonal in momentum space (k = 
— iV) to yield 



G D (r 



I \ / I —tT\ l\ 

r ; t) — (r|e |r ;. 




(A.48) 



r ; t = 




Now use the identity 



(r|k) 



1 



(A.49) 



(2tt) 3 / 2 
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to derive 



G D (r^r';r) = 7^ / dk e^V^i^ 



(27T) 

Using the integral relations 



(2nf 

1 POO /* 7T 

/ dA^V Tfe2 / 2 / ^sin(^)e ifc ' r " r '' cos W. 
Jo Jo 



77 



d6> sin(^)e acos W = 2sinh(Q) (A.50) 







and 

-2 







cir re ar sin(rfr) = — - 3/2 (A.51) 



we arrive at the final result 



The Green's function for the iV-electron system is the product of iV one-electron Green's 
functions: 

The Green's function for the closely related Fokker-Planck equation can be derived fol- 
lowing an almost identical procedure as was used for the diffusion equation. The Green's 
function for the Fokker-Planck equation is given by: 

G FP (r <- r'; r) = (r|e Ar[v2 - v ' F - F - vl |r / ). (A.52) 

As a first step insert two realizations of the unity operator in the momentum space repre- 
sentation to yield 

G FP {r <- r'; r) = J dk (r|k)e- Ar[fc2+2ik - F] (k|r'). (A.53) 
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It is assumed that the function F is constant over the short time interval r. Use the identity 
in Eq. A.49 

G FP (r<-r';T) = — ^ [ dk e ik-(r-r') e -Ar[^+2ik.F] 

(2tt) j J 

= fdkk 2 e~ XTk2 I dQ sin^ fc l r - r '- 2ArF l cose , (A.54) 

and the integral relations, Eqs. A. 50 and A. 51, to arrive at 

Gfp{1 *" r ' ; T) = (4^ e " ( '""" 2jTr)2/liT - (A - 55) 
The Green's function for the A— electron system is: 

G FP (R .- E'i r) = ^j-^-l-"'^/* (A.56) 

A. 7 The Cusp Conditions 
A. 7.1 The Nuclear Cusp Condition 

The nuclear cusp condition is the direct result of the divergence of the potential V(r) = —Z/r 
at the origin, i.e. r — > 0. To derive the nuclear cusp condition begin with the Schrodinger 
equation for a Hydrogen-like system of a single electron and a nucleus of charge Z : in atomic 
units this is given by 

V>(r) = E^ir). (A.57) 



1 . Z 
--V 2 

2 r 



The divergence of the potential energy as the electron approaches the nucleus is canceled by 
a divergence in the kinetic energy, resulting in the energy E remaining finite. 

As a next step decompose ip{r) into the product of a radial function and a spherical 
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harmonic, ip(r) = R(r)Yi m (8 , <fi) , and substitute into Eq. (A. 57) to yield the radial equation 

d 2 R 2dR 2Z 1(1 + 1) fk , 

77 + -T- + — fl- 1 , J R + 2ER = 0. A.58 

ar z r dr r r z 

Let -R(r) = r m p(r), where p(r) is a function that is finite in the limit r — > 0, with the first and 
second derivatives R'(r) = mr m ~ l p + r m p' and R"{r) = m(m — l)r m ~ 2 p + 2mr m ~ 1 p' + r m p", 
respectively. The terms containing r~ 2 cancel for m — I, reducing Eq. (A.58) to 

p" 2(1 + 1) pf 2Z , kS 
— + V - + — + 2£ = 0. A.59 

p r p r 

Canceling the divergences in the r~ l terms leads to the desired electron-nuclear cusp condi- 
tion 

"' =-jfj. (A.60) 



P 

In the r — > limit it is possible to approximate the radial part of ip(r) by R l (r) pa 
r l exp(-Zr/{l + l)) [7]. 

A. 7. 2 The Electron-Electron Cusp Condition 

To derive the electron-electron cusp condition begin with the Hamiltonian for a two-electron 
system 

which can be written in terms of the relative r = ri — r2 and center-of-mass coordinates 
R= ( ri + r 2 )/2 

H = -V 2 r -\v 2 R + 1 -. (A.62) 
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Following an almost identical process that was used in deriving the electron-nuclear cusp 
condition, the Schrodinger equation for the relative coordinate is given by the expression 



d 2 R 2dR 1 

dr 2 r dr r 



1(1 + 1] 



R + ER = 0. 



(A.63) 



Canceling the r 2 terms leads to 



P" , 2(l + l) p' 
p r p 



- + E = 

r 



(A.64) 



which yields the cusp condition 



P_ 

P 



(A.65) 



P =o 2(1 + 1)" 

For the singlet (S = 0) case, the electrons are in a s (I = 0) like state. This is due to the 
fact that fermions obey the anti-symmetry condition for particle exchange Eq. (6.1) - if the 
spin part of the wave function is in an anti-symmetric singlet state, the position space part 
must be symmetric. For the triplet (S — 1) case, the electrons are in a p (I — 1) like state. 
These cases can be summarized by the relation 



P_ 

P 



r=0 



5 for opposite spins 



4 for parallel spins 



(A.66) 
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A. 8 Proof of the Relation Between the 



Norm- Conservation Condition and the Energy 
Derivative of the Logarithmic Derivative 



To prove the relationship in Eq. (7.8), restated here as 



-2vr 



{rip)' 



d_d_ 

de dr 



2.1.2 



An dr r ip 



(A.67) 



we follow procedure similar to that of Shirley et al. [205]. It is necessary to begin with 
the single electron, non-relativistic radial Schrodinger equation for a spherically symmetric 
potential (in atomic units) 



Id 2 



2 dr 2 



+ 



1(1 + 1) 
2r 2 



V-e 



0. 



(A.68) 



where <p = rip is the radial orbital and e is the energy corresponding to ip (not necessarily 
an eigenvalue). Define the variable X which is the radial logarithmic derivative of <j) : 



X(e,r) = ^- [ln0(e,r) 



(A.69) 



Differentiate Eq. (A.69) with respect to r 



X'(e,r) 



d 
dr 



(A.70) 



which leads to the relation 



[X'( e ,r)+X( e ,r) 2 ]=^-. 



(A.71) 
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By use of the radial Schrodinger equation, Eq. (A. 68), it is possible to substitute for (f>" to 
yield 

Z/7-u-n 

2[V-e\. (A. 72) 



X'(e,r)+X(e,r?= l -V±± 



Now differentiate with respect to e 



§-X'(e,r) + 2X(e,r)^-X(e,r) = -2, 



(A.73) 



which can be shown to equal 



d_ 
dr 



-20 2 



(A. 74) 



The final step is to integrate 



-2 / dr<f) 2 . 



(A.75) 



The above relation is equal to Eq. (A. 67) by substituting = rip and multiplying both sides 
by -2?r. 

A. 9 Input File for Gaussian 03 

This is an example input file for the Ge2H 6 molecule for the Gaussian 03 package [11]. This 
input file includes the Ge and H atomic basis sets and the Gaussian representation of the 
HF pseudopotential from Ref. [12]. 

°/„mem=2000000 
/„chk=Ge2H6a.chk 



#P HF Gen Pseudo=Read pop=regular units=au 
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Ge2H6 relativistic pseudopotential 



1 

Ge 0.000000000000 0.000000000000 2.27050594990 
Ge 0.000000000000 0.000000000000 -2.27050594990 
H 0.000000000000 2.690399997713 3.38490611634 
H -2.329954744198 -1.345199998856 3.38490611634 
H 2.329954744198 -1.345199998856 3.38490611634 
H 0.000000000000 -2.690399997713 -3.38490611634 
H -2.329954744198 1.345199998856 -3.38490611634 
H 2.329954744198 1.345199998856 -3.38490611634 



Ge 

SP 1 1.0 

1.8893 1.0 1.0 
SP 1 1.0 

1.2528 1.0 1.0 
SP 1 1.0 

. 3473 1.0 1.0 
SP 1 1.0 

0.1199 1.0 1.0 
D 1 1.0 

0.3225 1.0 1.0 
**** 

H 

S 10 1.0 
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5.194042250000e+02 
1.164538900000e+02 
5.929693250000e+01 
3.038276170000e+01 
1.527814390000e+01 
5.228406600000e+00 
2.011882860000e+00 
8.278748700000e-01 
3.528963300000e-01 
1.519936310000e-01 

S 10 1.0 
2.057217150000e+02 
4.612417980000e+01 
2.348588260000e+01 
1.203377550000e+01 
6.051252190000e+00 
2.070827920000e+00 
7.968514160000e-01 
3.278984450000e-01 
1.397725210000e-01 
6.020049310000e-02 

SI 1.0 
1.243000000000e-01 

PI 1.0 
1.083800000000e+00 



5.993969000000e-04 
1.182210200000e-03 
4.463008800000e-03 
1.965309700000e-03 
2.432256750000e-02 
7.729528480000e-02 
1.919010140000e-01 
3.582211210000e-01 
3.734942420000e-01 
1.114260400000e-01 

5.993969000000e-04 
1.182210200000e-03 
4.463008800000e-03 
1.965309700000e-03 
2.432256750000e-02 
7.729528480000e-02 
1.919010140000e-01 
3.582211210000e-01 
3.734942420000e-01 
1.114260400000e-01 

1.000000000000e+00 

1.000000000000e+00 
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Ge 

DFPP 2 28 
Vd 

8 

1 

2 

2 

3 

3 

4 

4 

4 

Vs - Vd 
8 
2 
2 
3 
3 
3 
4 
4 
4 

Vp - Vd 

8 
2 
2 
3 



1.61064071 
2.09933155 
0.52560138 
0.96723061 
1.02293234 
1.46268678 
2.74284856 
2.07410625 



4.00000000 
-1.98792973 
-1.11113943 
17.12776241 
-10.68519957 
-9.96240393 
-4.17742874 

9.38249264 



3.72786591 
4.84415098 
3.47177005 
5.17417165 
5.86299536 
1.25319886 
4.34019107 
1.48777601 



3030.66009545 
-3025.78448986 
-145.29433187 
476.96147327 
-331.66714140 
-5.57011669 
-3355.75676252 
1.86441160 



0.54761787 
6.19779895 
2.26197595 



0.89906275 
0.92815207 
-19.85908121 
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3 



1.51819178 



74.08955303 



3 



1.24770700 



54.23047181 



4 



1.67705941 



27.51590356 



4 



2.53042379 



81.64533076 



4 



2.93857052 



60.37426995 



A. 10 Data for Germanium 

This section outlines the procedure to spin-orbit correct the spectral data for Ge. 

A. Start with the spectral data from Ref. [15]. Only the energy levels relevant to com- 
puting the excitation (removal and promotion) energies are included, see Table A. 3. 

B. Add 63713.24 cm" 1 to all energies Ge II; add (63713.24 + 128521.3) cm" 1 to all energies 
Ge III; add (63713.24 + 128521.3 + 274693) cm" 1 to all energies Ge IV. This creates 
an overall level scheme, see Table A. 4. 

C. Reference all levels relative to a level that does not have a spin-orbit splitting because 
either the total L or S is zero. A suitable choice is the 4s 2 level, see Table A. 5. 

D. Perform spin-orbit averaging. Note that the averages are computed as J weighted 
(2 J + 1) averages, see Table A. 6. 

E. The atomic excitation energies are computed as energy differences. The conversion 
factor is 8065.544 ev/cm [91]. 
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(a) Energy levels of neutral Ge (Ge I). 



Configuration 


Term 


J 


Level(cm 1 ) 


As 2 Ap 2 


3p 





0.0000 






1 


557.1341 






2 


1409.9609 


AsAp 3 


5 S 


2 


41926.726 


Ge II ( 2 P 1/2 ) 


Limit 




63713.24 


(b) Energy levels of singly 


ioni2 


;ed Ge (Ge II). 


Configuration 


Term 


J 


Leve^cm" 1 ) 


As 2 Ap 


2p 


1/2 


0.000 






3/2 


1767.357 


AsAp 2 


4p 


1/2 


51575.885 






3/2 


52290.942 






5/2 


53366.738 


Ge III (%) 


Limit 




128521.3 


(c) Energy levels of doubly 


ioniz 


;ed Ge (Ge III). 


Configuration 


Term 


J 


Level(cm _1 ) 


4s 2 







0.0000 


Ge IV ( 2 S 1/2 ) 


Limit 




274693 


(d) Energy levels of triply 


ioniz* 


3d Ge (Ge IV). 


Configuration 


Term 


J 


Level(cm _1 ) 


4s 


2 S 


1/2 


0.0000 


Ge V ( l S ) 


Limit 




368720 



Table A. 3: The energy levels of atomic Ge. 



A. 11 The Local Pseudopotential for Germanium 
A. 11.1 Introduction 

Previous QMC work on Ge have suggested that a local potential is a good approximation 
[49, 50, 51]. Evaluating a local potential is much less computationally expensive in QMC than 
evaluating non-local potential. In addition, for a local potential the potential energy depends 
on the relative positions of the ions and electrons and the functional form of the potential, 
in contrast the potential energy of a non-local potential explicitly depends on the trial wave 
function (see Eq. (7.20)). As discussed in Section 7.6.3, using a non-local pseudopotential 
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Configuration 


Term 


J 


Level(cm 1 ) 


As 2 Ap 2 


3p 





0.0000 






1 


557.1341 






2 


1409.9609 


AsAp 6 


5 S 


2 


41926.726 


As 2 Ap 


2p 


1/2 


63713.24 






3/2 


65480.597 


AsAp 2 


4p 


1/2 


115289.125 






3/2 


116004.182 






5/2 


117079.978 


As 2 


1 5 





192234.54 


As 


2 S 


1/2 


466927.54 


ble A. 4: Overall level scheme of energy levf 


Configuration 


Term 


J 


Level (cm l ) 


As 2 Ap 2 


3p 





-192234.54 






1 


-191677.4059 






2 


-190824.5791 


AsAp 3 


5 S 


2 


-150307.814 


As 2 Ap 


2p 


1/2 


-128521.294 






3/2 


-126753.943 


AsAp 2 


4p 


1/2 


-76945.415 






3/2 


-76230.358 






5/2 


-75154.562 


As 2 







0.000 


As 


2 S 


1/2 


274693 



Table A. 5: Energy levels relative to 4s 2 . 



in DMC introduces another layer of approximations and violates the variational principle. 
For these reasons it is always desirable to avoid using non-local pseudopotentials in QMC 
whenever possible. Therefore, as part of the work for this thesis a local pseudopotential was 
generated for Ge. 



A. 11. 2 The Generation of the Pseudopotential 

All of the HF results in this section are from the numerical HF code discussed in Chap- 
ter 2. A modification to this code was used to create the local potential. The advantage 
to using this code as opposed to the Gaussian 03 package [11] is that there are no basis 
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Configuration 


Term 


Level (cm 1 ) 


As 2 Ap 2 


3p 


-191265.517 


4s4p 3 


5 S 


-150307.814 


4s 2 4p 


2p 


-127343.06 


4s4p 2 


4p 


-75811.636 


4s 2 


l s 


0.000 


4s 


2 S 


274693 



Table A. 6: Spin-orbit averaged energy levels. 



set errors and it is possible to explicitly occupy individual orbitals with quantum num- 
bers (n, l,m, s). All of the atomic configurations used in this study are occupied according 
to Hund's rules. 1 For example, the ( 3 P) ground state of Ge is created by occupying 
the (4, 0, 0, |), (4, 0, 0, j), (4, 1, 1, |), (4, 1, 0, |) atomic orbitals: the ( 5 F) excited state is cre- 
ated by occupying the (4, 0, 0, f), (4, 1, 1, T), (4, 1, 0, |), (4, 2, 2, f) atomic oribtials. 
The specific form of the local potential is the Starkloff-Joannopoulos [52] form 

Z (l - e ~ ar ) 

V sAr) = - 7 (l + e «J)) - (A76) 

where Z = 4 for Ge and a and r c are adjustable parameters. This potential was found to be 
a reasonable approximation for Ge in a study by Rajagopal et al. [49, 50]. Test using LDA 
indicated that this potential was significantly inferior to fully norm- conserving non-local 
potentials but of similar quality to the best pseudo-Hamiltonian [50]. 

The pseudopotential was constructed to be a norm-conserving pseudopotential. The 
fitting procedure closely follows the HSC criteria in Section 7.2.1: at the matching radius 
the partial-norms, eigenvalues and radial logarithmic derivatives of the pseudo orbitals are 
set to be as closely equal to the corresponding results for the all-electron orbitals. Since 
the potential is local and is simultaneously fit for the s and p channels, it is difficult to 
match the quantities with the same level of accuracy as a non-local potential, which have 
much more variational freedom. The best fit was obtained for the matching radius of 1.7 



1 For an excellent reference see Ref. [2581 
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(nl) Value 


Pseudo HF 


All-Electron HF 


4s 


-0.54972 


-0.56670 


4p 


-0.28710 


-0.28614 



Table A. 7: Results from the generation of the local potential: comparison of the eigenvalues 
from pseudo Hartree-Fock (HF) and all-electron HF. 



(nl) Value 


Local 


Non-Local 


4s 


-0.54972 


-0.56657 


4p 


-0.28710 


-0.28629 


Total Energy 


-3.63041 


-3.66673 



Table A. 8: Comparison of the eigenvalues from the local pseudopotential to the non-local 
pseudopotential discussed in Section 7.4 

Bohr, the optimal values of the parameters are a and r c . Table A. 7 summarizes the results 
for the fitting of the pseudopotential. Table A. 8 compares the results for the eigenvalues 
ant total energy of the local potential to the non-local potential of Ref. [12, 209]. The 
non-local pseudopotential (see Section 7.4 for a detailed discussion) is used for almost all of 
the calculations presented in this thesis. The local potential underbinds the 4s and slightly 
overbinds the Ap electrons relative to the non-local potential. 

A. 11. 3 Results for Atomic Excitations 

We have performed an almost identical set of calculations for the atomic excitations of Ge 
for the local potential as we did for the non-local potential in Chapter 8. Table A. 10 presents 
the results for the first four ionization energies, the s removal energy, the s — > p promotion 
energy and the s — > d promotion energy. We have also included GW results of Shirley [17] 
wherever available. The results for the local potential, with the exception of the first two 
ionization energies, are not as accurate as the results for the non-local potential. This is 
especially true for the third ionization energy and the s removal energy and s —>■ p promotion 
energies. 

Table A. 11 contains the total energies for the different atomic configurations of Ge. We 
have included the results from HF and both VMC and DMC, with and without the CPP. It is 
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4 I i i i I i I i I i I i I i I i I i I i I 

0123456789 10 

Figure A.l: The local Ge pseudopotential. All units in a.u. 
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reassuring to note that the size of the CPP estimators for the various atomic configurations 
are almost identical. Therefore, it is not surprising that the size of the CPP corrections for 
the atomic excitations are also almost identical. 

Table A. 9: The CPP estimator (H-cpp) of various atomic configurations of Ge. All energies 
in eV. 



No. of electrons Configuration 


(Hcpp) 
VMC DMC 


4 4:sHp 2 ( 3 P) 
AsAp 3 ( 5 S) 


-1.74153(21) -1.76390(65) 
-1.36199(27) -1.37570(49) 


3 As 2 Ap( 2 P) 
AsAp 2 ( 4 P) 


-1.69701(33) -1.72689(65) 
-1.28790(23) -1.31023(49) 


2 4s 2 { 1 S) 


-1.69059(30) -1.72757(82) 


1 4s( 2 S) 


-1.10013(14) -1.11483(41) 
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Table A. 10: For the Ge atom results for the s — > p promotion energy, the s removal energy 
(s~), the first four ionization energies and the total energy (TE) of the pseudoatom, with and 
without the CPP. All excitation energies are total energy differences. The CPP correction, 
ACPP, is calculated as the energy difference of the results with and without the CPP. 
Experimental results from Ref. [15] have been properly spin-orbit corrected [16]: the J 
weighted averaging has been performed with respect to the 4s 2 ( l S) configuration [20]. All 
energies in eV and the QMC error bars are smaller 0.01 eV. 





s — > p 

s y(3 P) ^ 
sp 3 ( 5 S) 


s 

s 2 p 2 ( 3 P) -> 
sp 2 ( 4 P) 


s 2 p 2 ( 3 P) 
sp 2 d( 5 F) 


Expt. 


5.08 


14.31 




DMC+CPP 


4.71 


13.94 


11.98 


DMC 


4.35 


13.52 


11.55 


ACPP 


0.36 


0.42 


0.43 


VMC+CPP 


4.55 


13.76 


11.84 


VMC 


4.16 


13.30 


11.36 


ACPP 


0.39 


0.46 


0.48 


HF 


2.94 


11.78 


9.85 





1st IP 


2nd IP 


3rd IP 


4th IP 






s 2 p 2 { 3 P) 


s 2 pi 2 P) -> 


s 2 ( 1 S) 


s( 2 S) - 


TE of 




s 2 p( 2 P) 


s 2 ( 1 S) 


s( 2 S) 


[Ar}3d l0 ( l S) 


s 2 p 2 ( 3 P) 


Expt. 


7.93 


15.79 


34.06 


45.72 


103.50 


DMC+CPP 


7.91 


15.68 


33.57 


45.53 


102.69 


DMC 


7.90 


15.68 


32.98 


44.41 


100.97 


ACPP 


0.01 


0.00 


0.59 


1.12 


1.72 


VMC+CPP 


7.84 


15.59 


33.52 


45.53 


102.48 


VMC 


7.78 


15.59 


32.95 


44.41 


100.73 


ACPP 


0.06 


0.00 


0.57 


1.12 


1.75 


HF 


7.43 


15.16 


31.78 


44.41 


98.78 
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HF 



VMC 



VMC+CPP 



DMC 



DMC+CPP 



(4s) 2 (4p) 2 

(4s) 2 (4p) 

(4s) (4p) 3 

(4s) (4p) 2 

(4s)(4p) 2 (4ci) 

(4s) 2 

(4s) 



-98.78840 
-91.35561 
-95.84851 
-87.00542 
-88.94278 
-76.19153 
-44.40898 



-100.72243(38) 

-92.94555(27) 

-96.56332(23) 

-87.42627(13) 

-89.36128(20) 

-77.35970(25) 

-44.40898 



-102.47892(65) 

-94.64231(49) 

-97.92452(35) 

-88.71907(27) 

-90.63761(41) 

-79.05037(30) 

-45.52550 



-100.9702(14) 

-93.0715(10) 

-96.62191(65) 

-87.45009(38) 

-89.41582(63) 

-77.39284(57) 

-44.40898 



-102.6898(13) 

-94.7775(11) 

-97.98107(63) 

-88.75338(52) 

-90.70939(87) 

-79.09763(73) 

-45.52550 



Table A. 11: Hartree-Fock (HF) and QMC total energies for various atomic configurations of 
Ge. For the (4s) configuration, we did not perform QMC since the numerical Hartree-Fock 
results are exact for a single electron in a central potential. All energies in eV. 
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